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Abstract 

Studying  the  social  and  cultural  transmission  of  behavior  among  animals  helps  to 
identify  patterns  of  interaction  and  information  content  flowing  between  individuals.  Killer 
whales  are  likely  to  acquire  traits  culturally  based  on  their  population-specific  feeding 
behaviors  and  group-distinctive  vocal  repertoires.  I  used  digital  tags  to  explore  the 
contributions  of  individual  Norwegian  killer  whales  to  group  carousel  feeding  and  the 
relationships  between  vocal  and  non-vocal  activity7. 

Periods  of  tail  slapping  to  incapacitate  herring  during  feeding  were  characterized  by 
elevated  movement  variability,  heightened  vocal  activity  and  call  types  containing  additional 
orientation  cues.  Tail  slaps  produced  by  tagged  animals  were  identified  using  a  rapid  pitch 
change  and  occurred  primarily  within  20m  of  the  surface.  Two  simultaneously  tagged 
animals  maneuvered  similarly  when  tail  slapping  within  60s  of  one  another,  indicating  that 
the  position  and  composition  of  the  herring  ball  influenced  their  behavior. 

Two  types  of  behavioral  sequence  preceding  the  tight  circling  of  carousel  feeding 
were  apparent.  First,  the  animals  engaged  in  periods  of  directional  swimming.  They  were 
silent  in  2  of  3  instances,  suggesting  they  may  have  located  other  foraging  groups  by 
eavesdropping.  Second,  tagged  animals  made  broad  horizontal  loops  as  they  dove  in  a 
manner  consistent  with  corralling.  All  4  of  these  occasions  were  accompanied  by  vocal 
activity,  indicating  that  this  and  tail  slapping  may  benefit  from  social  communication.  No 
significant  relationship  between  the  call  types  and  the  actual  movement  measurements  was 
found. 


Killer  whale  vocalizations  traditionally  have  been  classified  into  discrete  call  types. 
Using  human  speech  processing  techniques,  1  considered  that  calls  are  alternatively 
comprised  of  shared  segments  that  can  be  recombined  to  form  the  stereotyped  and  variable 
repertoire.  In  a  classification  experiment,  the  characterization  of  calls  using  the  whole  call,  a 
set  of  unshared  segments,  or  a  set  of  shared  segments  yielded  equivalent  performance.  The 
shared  segments  required  less  information  to  parse  the  same  vocalizations,  suggesting  a  more 
parsimonious  system  of  representation. 

This  closer  examination  of  the  movements  and  vocalizations  of  Norwegian  killer 
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whales,  combined  with  future  work  on  ontogeny  and  transmission,  will  inform  our 
understanding  of  whether  and  how  culture  plays  a  role  in  achieving  populadon-specific 
behaviors  in  this  species. 
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bottom  by  sequentially  adding  both  low  and  high  frequency  components  to  the  N16.1 
base.  .  137 


15 
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shown  in  the  third  row.  Here,  the  segments  of  each  call  were  honored  with  a  polynomial 
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were  categorized  to  type  based  on  the  entire  call  type  (upper  left),  unshared  segments 
(bottom  row)  or  a  combination  of  shared  and  call-specific  segments  (upper  right).  The 
unshared  segments  were  scored  stricdy  (i.e.,  each  segment  within  a  call  type  had  to  match 
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Figure  5.9.  Rarefacdon  curves  plotdng  the  number  of  call  types  (top  left)  and  shared 
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and  viewed  for  all  data  (left)  and  zoomed  region  (right,  magnified  to  box  depicted  on 

left).  . 156 

Figure  5.10.  Sampling  of  matches  of  alternative  calls  (dark  black  traces)  with  Norwegian 
stereotyped  call  syllable  matches  (gray  traces).  The  alternative  calls  included  Norwegian 
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segments  (third  column)  that  performed  at  threshold  percentages  (see  Table  5.3)  of  90%, 
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Figure  5.11.  a:  Syllables  found  in  at  least  two  call  types.  Note  different  axis  scales,  b:  Color 
code  for  syllable  traces  of  different  call  type  origins  (legend  locations  here  match  trace 
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Figure  A4.1.  Narwhal  shown  with  Crittercam  (contained  within  the  dashed  ellipse)  and 
DTAG  (contained  within  the  dashed  rectangle)  attached  immediatelv  before  release. 
Photograph  courtesy  of  Rune  Dietz.  . 253 


Figure  A4.2.  Sample  spectrograms  (larger,  top  plots)  and  waveforms  (smaller,  bottom 
plots)  of  a  combined  tonal/pulsed  signal  produced  by  individual  (a)  mm224  and  (b) 
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most  of  the  pulses  is  likely  due  to  the  resonance  of  the  air  sacs  involved  in  sound 
production  or  transmission.  The  solid  arrows  in  both  spectrograms  indicate  the 
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Figure  A4.3.  Spectrogram  composite  of  all  four  whistles  of  mm224  (a-d)  and  14  of  the  17 
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Figure  A4.4.  Digitized  traces  of  the  fundamental  frequency  contours  of  the  whistles 
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indicated  by  filled  circles  (•)  while  the  other  spectral  components  are  marked  with 
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Chapter  1.  General  Introduction 


1.1  Social  learning  and  animal  culture 

The  social  intelligence  hypothesis  proposes  that  social  environments  have  placed  a 
premium  on  learning  and  the  cognitive  and  behavioral  adaptability  associated  therein,  which 
may  have  led  to  the  development  of  intelligence  more  generally.  Whiten  &  Byrne  (1988) 
asserted  that  observational  learning  may  have  become  more  prevalent  as  social  intelligence 
became  increasingly  developed.  Individuals  belonging  to  social  species  must  interact  with 
other  dynamic  agents  to  form  relationships  with  predictable  patterns  of  affiliative, 
manipulative  or  aggressive  behaviors  (Byrne  &  Whiten,  1988;  Whiten  &  Byrne,  1997).  Social 
transmission  of  behavior  can  occur  vertically  between  parents  and  offspring,  horizontally 
between  members  of  the  same  generation  and  obliquely  between  individuals  of  different 
generations  (Cavalli-Sforza  &  Feldman,  1981).  Originally  documented  with  regard  to 
primates,  evidence  now  indicates  that  numerous  species  of  fishes  exhibit  behaviors 
consistent  with  social  intelligence  (see  Bshary  et  al.,  2002  for  a  review)  and  it  is  reasonable  to 
suppose  that  other  taxa  are  similarly  capable.  Social  learning  may  be  influenced  bv  both  pre¬ 
existing  social  dynamics  (Coussi-Korbel  &  Fragaszy,  1995)  and  the  cognitive  capacity 
determining  the  information  content  that  can  be  transmitted  successfully  (Byrne  et  al.,  2004). 
Because  learners  must  approach  others  closely  to  observe  and  model  their  behaviors, 
relationships  between  kin  or  bonded  animals  often  provide  the  gregariousness  and  tolerance 
that  afford  the  necessary  proximity  (Coussi-Korbel  &  Fragaszy,  1995;  van  Schaik  et  al., 

1999).  A  theoretical  continuum  of  instruction  ranges  from  drawing  attention  to  and 
indirectly  encouraging  interaction  with  certain  features  of  the  environment  to  the  active 
instruction  and  shaping  of  behavior  more  directly  (Caro  &  Hauser,  1992;  Boesch  & 
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Tomasello,  1998). 

Social  learning  is  fundamental  to  cultural  transmission,  which  refers  to  the 
dissemination  of  behaviors  via  imprinting,  conditioning,  imitation,  facilitation,  teaching  and 
local  or  stimulus  enhancement  (Zentall,  2006).  Culture  and  cultural  transmission  are  terms 
first  used  by  social  anthropologists  for  human  beings  exclusively.  When,  in  the  1970s, 
biological  anthropologists  first  introduced  evidence  to  support  the  idea  of  animal  culture 
(e.g.,  Kummer,  1971),  it  seemed  paradoxical  and  even  heretical.  Culture,  they  argued,  was 
evident  when  communities  of  animals  were  distinguishable  based  on  a  characteristic  set  of 
behaviors  where  genetic  (i.e.,  heritable)  and  ecological  or  environmental  explanations  were 
unsupported  (see  de  Waal,  1999;  Boesch,  2003).  The  notion  of  animal  culture  has  come  to 
help  frame  arguments  both  about  how  animals  learn  and  transmit  information,  underscoring 
the  ways  in  which  organisms  and  the  cultures  they  adopt  shape  one  another,  and  about 
conservation  (Laiolo  &  Jovani,  2006).  As  a  process,  cultural  transmission  of  a  trait 
commences  when  a  new  behavior  called  an  innovation  is  introduced  and  subsequently 
diffuses  through  all  or  part  of  the  population  as  increasing  numbers  of  individuals  learn  the 
behavior  from  one  another  (Laland  &  Hoppitt,  2003).  I  next  turn  to  research  that  has 
addressed  the  social  and  cultural  transmission  of  behavior  in  the  wild  and  in  captivity.  Then, 
1  address  killer  whales  ( Orcinus  orca)  as  an  ideal  species  for  exploring  foraging  traditions  and 
population-specific  vocal  dialects,  which  both  show  evidence  for  being  acquired  via  social 
learning. 

1.1.1  Foraging  behavior 

The  earliest  work  on  animal  culture  and  social  transmission  focused  primarily  on  the 
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tool  use  and  foraging  specializations  of  highly  social  non-human  primates.  One  of  the 
earliest  examples  documented  the  spreading  of  sweet  potato  washing  behavior  developed 
originally  by  an  18-month-old  Japanese  macaque  ( Macaca  fuscata)  to  other  members  of  her 
social  group  (Imanishi,  1957).  The  slow  rate  of  spreading  of  this  behavior,  however,  was 
thought  by  some  to  be  inconsistent  with  cultural  diffusion  as  a  transmission  mechanism 
(reviewed  in  Whiten,  2000).  Recendy,  Whiten  et  al.  (1999)  collated  39  separate  behaviors 
that  characterized  certain  chimpanzee  {Pan  troglodytes)  communities  but  not  others  across 
seven  field  sites  (Whiten  et  al.,  1999).  Many  of  these  behaviors  related  to  population-specific 
foraging  activities.  The  authors  asserted  that  these  behaviors  were  culturally  transmitted  but 
recent  work  has  suggested  that  the  discrediting  of  ecological  explanations  may  have  been 
premature.  Humle  &  Matsuzawa  (2002)  demonstrated  that  the  differences  in  ant-dipping 
foraging  techniques  by  different  chimpanzee  populations  -  the  example  emphasized  by 
Whiten  et  al.  (1999)  —  could  be  explained  by  the  aggressiveness  of  the  ant  species  rather  than 
by  cultural  inheritance.  The  same  criticism  of  failing  to  take  environmental  or  phvsical 
features  into  account  may  be  levied  against  the  assertion  of  culture  in  orangutans  ( Pongo 
pygmaeus)  by  van  Schaik  et  al.  (2003)  who  used  a  similar  compilation  and  comparison 
approach  to  the  Whiten  et  al.  (1999)  study. 

Stronger  evidence  for  foraging  culture  has  emerged  from  research  conducted  on 
chimpanzees  in  captivity.  The  general  experimental  paradigm  has  involved  seeding  a 
different  feeding-related  behavioral  sequence  that  achieves  the  same  objective  in  two 
populations  and  observing  whether  and  how  these  behaviors  propagate  within  the 
communities.  This  approach  is  modeled  after  the  observation  that  wild  chimpanzee 
populations  often  display  unique  sequences  of  behavior  to  achieve  an  identical  goal  (Whiten 
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&  Boesch,  2001;  Whiten,  2005).  Although  the  collective  results  of  these  experiments  have 
been  ambiguous  (e.g.,  Horner  et  al.,  2006;  Hopper  et  al.,  2007),  a  more  general  point  is  that 
species  that  lend  themselves  to  experimental  manipulation  with  appropriate  genetic  and 
environmental  controls  can  provide  more  compelling  evidence  for  animal  culture  (Laland  & 
Hoppitt,  2003).  Within  the  foraging  domain,  for  example,  feeding  preferences  among 
captive  Norwegian  rats  were  transmitted  culturally  (Rattus  norvegicus ,  Galef  &  Allen,  1995). 
Other  examples  include  the  social  transmission  of  migration  routes  among  wild  French 
grunts  (Haemulon  jlavolineatum ,  Helfman  &  Schultz,  1984),  mating  site  preferences  among  wild 
bluehead  wrasse  ( Thalassoma  bifasciatum ,  Warner,  1988)  and  affiliative  and  competitive 
behaviors  among  captive  cowbirds  ( Molothrus  ater ,  White  et  al.,  2007).  These  species  tend  to 
be  more  tractable  for  experimental  manipulation,  observation  and  interpolation  whereas  the 
lack  of  experimental  controls  has  caused  the  ape  work  to  remain  largely  inconclusive. 

1.1.2  Vocal  behavior 

Another  suite  of  evidence  for  social  learning  and  animal  culture  concerns  vocal 
behavior.  Vocal  production  learning  occurs  when  an  animal  modifies  its  acoustic  signals  due 
to  experience  with  other  individuals  to  render  these  signals  either  more  or  less  similar  to  the 
model  that  it  hears  (Janik  &  Slater,  1997;  Janik  &  Slater,  2000).  It  is  a  social  process  that  can 
lead  to  the  transmission  of  an  acoustic  repertoire  between  signalers  and  receivers.  Songbirds 
are  an  excellent  illustration  of  this  phenomenon  as  they  generally  learn  species-  and  often 
population-specific  song  through  an  iterated  procedure  of  listening  and  vocalizing  (Slater, 
1986).  Similarly,  in  the  case  of  male  humpback  whales  ( Megaptera  novaeangliae )  whose  song 
converges  and  evolves  simultaneously  among  all  individuals  within  a  population  (Payne  et  al., 
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1984),  intra-  and  intersexual  selection  on  vocal  behavior  may  be  relevant  if  females  compare 
males  based  on  song  quality  and  fidelity  as  research  suggests  (Tyack,  1999). 

Tyack  &  Sayigh  (1997:  230)  reported,  “Vocal  learning  may  provide  a  mechanism 
whereby  the  vocal  repertoire  can  develop  to  match  the  particular  social  system  experienced 
by  an  individual.”  Compared  with  vocal  learning,  vocal  culture  underscores  the  stable  social 
bonds  that  allow  information  to  be  transmitted  between  conspecifics  and  across  generations. 
A  focus  on  vocal  culture  emphasizes  the  ways  in  which  the  social  relationships  allow  learning 
to  occur.  Vocal  culture  can  cause  acoustic  badges  of  membership  to  emerge  among  certain 
animal  groupings.  For  example,  the  different  contact  calls  of  newly  formed  groups  of 
captive  male  budgerigars  ( Melopsittacus  undulatus)  converged  on  the  same  dominant  call,  which 
then  experienced  synchronous  changes  (Farabaugh  et  al.,  1994).  Called  conformity  bias,  this 
phenomenon  plays  a  role  more  generally  in  integrating  individuals  into  social  groups  (see 
Boesch  et  al.,  1994;  Sapolsky  &  Share,  2004;  Whiten  et  al.,  2005  for  additional  examples).  In 
addition,  though  other  explanations  including  genetics  have  not  yet  been  excluded,  Rendell 
&  Whitehead  (2003)  proposed  that  coda  variation  among  sperm  whales  ( Physeter 
macrocephalus )  may  be  best  explained  by  cultural  transmission  that  follows  association  patterns 
among  social  units  spread  over  an  entire  ocean  basin. 

1.2  Killer  whales  as  candidates  for  using  culture 

To  summarize  the  earlier  sections,  primates  offer  suggestive  but  inconclusive 
evidence  for  animal  culture.  Data  from  more  tractable  species  including  fishes  and  rats  have 
been  of  better  quality  but  of  somewhat  limited  scope.  The  cultural  aspects  of  vocal  learning 
exemplify  the  ways  in  which  social  bonds  influence  and  allow  the  transmission  of  vocal 
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behavior.  An  understanding  of  the  consequences  of  cultural  transmission  for  social  behavior 
requires  a  broad  taxonomic  comparison  of  animals  in  the  wild.  It  is  important  to  ascertain 
whether,  why  and  how  these  phenomena  are  deployed  by  a  variety  of  species  in  their  natural 
habitats.  If  genedc  inheritance  can  be  discredited,  group-disdncdve  behaviors  provide  a 
promising  stardng  point  for  invesdgadng  cultural  traits  since  frequent  opportunities  for 
social  behavioral  interaction  and  transmission  are  available  between  group  members. 

A  lively  debate  persists  over  the  presence  and  nature  of  culture  among  cetaceans 
more  generally  (see  the  review  by  Rendell  &  Whitehead,  2001  and  the  ensuing  commentary). 
Killer  whales  in  particular  are  excellent  candidates  for  investigating  the  areas  of  animal 
culture  and  social  learning.  They  are  characterized  by  stable  social  groups,  population- 
specific  foraging  strategies  and  vocal  repertoires  that  are  likely  to  be  transmitted  through 
learning.  Their  feeding  behaviors  include  hunting  fishes,  cephalopods,  sea  turtles,  sea  birds, 
mustelids,  pinnipeds  and  cetaceans  (e.g.,  Martinez  &  Klinghammer,  1970;  Christensen,  1978; 
Smith  et  al.,  1981;  Hoelzel,  1991;  Jefferson  et  al.,  1991;  Baird  et  ah,  1992;  Simila  &  Ugarte, 
1993;  Matkin  &  Saulitis,  1994;  Baird  &  Dill,  1995;  Baird  &  Dill,  1996;  Fertl  et  al.,  1996; 

Simila  et  ah,  1996;  Ford  et  al.,  1998;  Baird  &  Whitehead,  2000;  Ford  &  Ellis,  2006;  Simon  et 
ah,  2006).  The  diversity  of  hunting  strategies  parallels  the  diversity  of  prey  on  which  they 
feed.  Though  sufficient  ecological  differences  in  habitat  and  food  sources  exist  between 
many  of  these  populations  to  explain  the  variety  of  foraging  strategies,  the  adaptability  and 
flexibility  of  this  species  are  evident. 

Two  sympatric  populations  in  the  Pacific  Northwest  have  partitioned  their  niche 
according  to  a  diet  consisting  exclusively  of  salmon  or  marine  mammals  (Ford  et  ah,  1998; 
Saulitis  et  al.,  2000).  Another  example  of  social  transmission  of  foraging  behavior  concerns 
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Guinet  &  Bouvier’s  (1995)  report  of  adult  killer  whales  in  the  Crozet  Archipelago  teaching 
their  calves  how  to  beach  themselves  and  capture  pinniped  prey.  This  observation  has  not 
been  subjected  to  proper  experimental  scrutiny,  however,  and  remains  speculative. 
Longitudinal  data  collection  across  multiple  calves  and  juveniles  is  required  to  demonstrate 
teaching  in  this  population.  The  absence  of  both  regional  and  global  genetic  variation,  the 
former  likely  due  to  matrilineal  group  structure  and  the  latter  suggestive  of  an  earlier 
bottleneck  event  (Hoelzel  et  al.,  2002),  suggests  that  the  variability  in  foraging  behaviors  may 
have  been  more  likely  to  arise  from  learning  and/or  ecological  differences. 

Compared  to  the  fission-fusion  societies  of  bottlenose  dolphins  described  above, 
killer  whale  social  groupings  are  remarkably  stable,  generating  a  set  of  long-lasting 
relationships  between  a  set  of  animals  that  interact  reliably  with  known  conspecifics.  This 
feature  of  killer  whale  social  behavior,  combined  with  an  aptitude  for  learning,  create 
conditions  in  which  cultural  transmission  could  occur.  Among  fish-eating  resident  orcas,  the 
matriline  is  the  fundamental  social  unit  from  which  neither  male  nor  female  offspring 
emigrate  even  after  achieving  sexual  maturity  (Bigg  et  al.,  1990).  (Unless  otherwise  stated, 
the  research  discussed  here  comes  from  the  large  body  of  work  conducted  on  the  population 
and  vocal  dynamics  governing  the  killer  whale  communities  of  the  Pacific  Northwest.) 
Matrilines  that  associate  over  half  of  the  time  (based  on  visual  observations  at  the  surface) 
are  considered  to  belong  to  the  same  pod  (Bigg  et  al.,  1990)  and  matrilines  and  pods  sharing 
elements  of  their  acoustic  repertoire  are  referred  to  as  clans  (Ford,  1991;  Yurk  et  al.,  2002). 
Finally,  clans  that  associate  and  interact  regularly  are  considered  part  of  the  same  community’ 
even  if  their  association  measures  less  than  half  (Bigg  et  al.,  1990).  This  stable,  long-lasting 
and  gregarious  social  context  allows  these  animals  to  observe  and  learn  from  one  another,  a 
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scenario  supported  by  their  vocal  behavior. 

The  members  of  each  matriline  produce  a  set  of  specific  call  types,  which  serve  as 
group-distinctive  vocal  signatures  (Ford,  1991;  Miller  &  Bain,  2000).  Resident  pods  possess 
unique  vocal  repertoires  of  stereotyped  pulsed  calls  with  primary  energy  between  1  and  6 
kHz  (Ford,  1989).  These  calls  contain  both  a  high-frequency  component  (HFC  between  2 
and  12  kHz,  Hoelzel  &  Osborne,  1986)  beamed  forward  from  the  melon  and  a  less 
directional  low  frequency  component  (LFC  between  80  and  2400  Hz,  Ford,  1987).  The 
relative  energy  of  these  two  components  may  cue  conspecifics  into  the  orientation  of  the 
signaler  (Miller,  2002).  Discrete,  stereotyped  calls  constitute  the  majority  of  vocalizations  in 
most  behavioral  contexts  (Ford,  1989).  Hitherto,  the  individual  call  type  has  largely  been 
regarded  as  an  arbitrary  and  interchangeable  unit  without  any  specific  behavioral 
significance.  Certain  call  types  may  be  more  common  in  certain  contexts  (see  Deecke  et  al., 
2005;  Van  Opzeeland  et  al.,  2005)  but  in  general,  it  has  been  assumed  that  the  full  vocal 
repertoire  is  used  by  social  groups  to  differentiate  one  another.  If  this  is  the  sole  function, 
however,  it  is  not  clear  why  so  many  call  types  are  required.  It  is  possible  that  the  functions 
of  these  call  types  may  depend  on  a  behavioral,  social  or  interactive  context  that  we  have  not 
yet  been  able  to  discern  adequately. 

Although  no  definitive  study  has  been  conducted  demonstrating  vocal  learning  in 
killer  whales,  several  separate  observations  support  both  this  conclusion  (Bowles  et  al.,  1988; 
Foote  et  al.,  2006;  Nousek  et  al.,  2006;  Riesch  et  al.,  2006)  and  that  cultural  transmission  may 
be  the  mechanism  underlying  development  and  acquisition  of  the  vocal  repertoire.  Ford 
(1991)  suggested  that  when  pods  become  too  large  to  afford  all  members  adequate  access  to 
resources,  they  splinter  into  two  or  more  smaller  pods  and  acoustic  dialects  begin  to  diverge 
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slowly  through  cultural  drift  whereby  changes  arise  from  small  copying  errors  in  vocal 
imitation  and  transmission.  Over  time,  it  is  believed  that  this  gradual  divergence  has  led  to 
the  development  of  unique  repertoires  between  pods  and  changes  in  the  reladve  numbers  of 
call  types  within  pods.  A  comparison  of  two  stereotyped  calls  over  12-13  years  revealed  that 
different  call  types  have  undergone  structural  modificadons  at  different  rates,  likely  arising 
from  cultural  drift  (Deecke  et  ah,  2000). 

Undl  recendy,  analysis  of  the  social  and  vocal  behavior  of  free-ranging  killer  whales 
was  conducted  at  the  group  level  because  it  was  not  possible  to  monitor  individual  animals 
condnuously  in  dme  once  they  left  the  surface.  Recent  advances  in  recording  technology 
including  towed  beamforming  arrays  (Miller  &  Tyack,  1998)  and  digital  archival  tags 
(Johnson  &  Tyack,  2003)  are  allowing  more  detailed  invesdgauons  of  individual  behavior. 

For  example.  Ford  (1989)  described  that  calls  of  the  same  type  tended  to  follow  one  another 
but  it  was  unclear  whether  a  single  individual  produced  them  in  a  series  or  multiple  ^ 

individuals  were  exchanging  these  calls.  Using  a  towed  array  to  isolate  the  calling  behavior 
of  a  single  individual  within  a  group,  Miller  et  al.  (2004c)  ascertained  that  this  matched 
counter-calling  behavior  was  due  to  vocal  exchanges  between  individuals.  In  demonstrating 
fine-scale  vocal  interactions,  this  study  suggested  that  the  contextual  and  vocal  learning  of 
stereotyped  calls  may  be  reinforced  by  matched  counter-calling,  which  could  play  a  role  in 
coordinating  group  travel  or  maintaining  group  cohesion. 

1.3  Introduction  to  dissertation  research 

The  observational  support  for  social  learning  and  cultural  transmission  in  killer 
whales  is  certainly  suggestive,  though  it  has  not  been  demonstrated  explicidy.  The  diverse 
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array  of  feeding  strategies  speaks  to  the  behavioral  versatility  and  environmental  adaptability 
of  the  species,  features  that  tend  to  be  associated  with  learning.  This  dissertation  was 
motivated  by  an  interest  in  understanding  how  the  behavior  of  individual  killer  whales 
contributed  to  these  group  foraging  behaviors.  Their  stable  family  groupings  provide  a 
network  of  reliable  and  regular  interactions  through  which  behavior  could  propagate 
culturally,  a  point  reinforced  by  the  pod-specific  vocal  repertoires.  My  dissertation  also 
sought  to  explore  both  the  relationships  between  the  vocal  and  non-vocal  activity  patterns  of 
these  animals  and  the  possible  components  of  their  vocalizations  that  might  be  learned  and 
subsequently  concatenated  to  form  the  call  types  of  the  repertoire. 

This  study  benefited  from  selecting  as  its  study  species  the  killer  whales  that  inhabit  the 
fjords  of  northern  Norway  (Figure  1.1)  in  the  wintertime  as  they  follow  spring-spawning 
herring  ( Clupea  harengus)  stocks  (Simila  et  al.,  1996).  My  research  took  advantage  of  a  few 
long-term  studies  of  the  Norwegian  killer  whales,  including  a  catalog  of  almost  600  identified 
animals,  records  of  pod  composition  and  detailed  descriptions  of  carousel  feeding,  a  group 
foraging  behavior  involving  corralling  and  incapacitating  herring.  In  particular,  groups  of 
Norwegian  (and  Icelandic  as  documented  by  Simon  et  al.,  2005)  killer  whales  corral  herring 
by  circling  the  fish  to  trigger  their  coalescing,  ultimately  lunging  towards  and  tail-slapping  the 
prey  ball  to  incapacitate  the  fish  before  feeding  (Christensen,  1978;  Simila  &  Ugarte,  1993; 
Simon  et  al.,  2005).  The  primary  objectives  of  my  dissertation  were  to  examine  the 
movement  behaviors  and  association  patterns  of  individual  free-ranging  Norwegian  killer 
whales,  whether  their  movement  and  vocal  behavior  related  to  one  another  during  both 
foraging  and  non-foraging  contexts,  and  whether  call  subunits  were  assembled  syntactically 
into  discrete  calls  and  call  sequences.  In  Chapter  2, 1  investigated  how  individual  animals 
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Figure  1.1.  Map  of  field  site.  Top:  Scandinavia  with  box  corresponding  to  the  zoomed  in  plot  on  the  bottom. 
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Most  of  the  tagouts  were  conducted  in  Tysfjord  though  a  small  number  were  completed  in  Vestfjord. 
contributed  within  their  group  to  execute  their  foraging  sequences,  building  as  detailed  a 

spatial  and  temporal  picture  as  possible  of  the  carousel  feeding  behavior.  Chapter  3 
considered  the  different  association  patterns  of  simultaneously  tagged  killer  whales  during 
traveling  versus  carousel  feeding  periods. 

The  killer  whales  of  northern  Norway  produce  repertoires  of  3-16  call  types  (Strager, 
1993;  1995).  Call  subtypes  corresponded  to  calls  or  call  components  whose  duration  or 
repetition  rate  was  altered  significantly  or  to  calls  in  which  at  least  one  component  was  added 
or  removed.  Strager  (1995)  additionally  described  compound  calls,  which  either  contained 
two  or  more  calls  that  were  also  produced  individually  as  discrete  calls  or  included  a 
component  that  was  produced  within  multiple  discrete  call  types.  These  call  components 
underwent  significant  duration  and  repetition  rate  modifications  when  combined  in  tandem 
with  other  calls. 

The  behavioral  context  associated  with  the  acoustic  activity  of  an  animal  is  important 
when  assessing  the  social,  communicative,  foraging  or  other  functions  of  a  vocal  repertoire. 
Only  one  call  type  showed  different  rates  of  production  between  the  two  behavioral 
categories  of  seiner  and  carousel  feeding  (Van  Opzeeland  et  al.,  2005).  Otherwise,  call  types 
have  not  been  associated  with  particular  behaviors.  I  examined  the  potential  relationships 
between  the  movement  and  vocal  behaviors  of  these  animals  in  Chapter  4,  which  are 
valuable  for  understanding  the  function  of  vocal  behavior  and  whether  and  why  animals 
deploy  different  signals  from  their  repertoire  in  different  contexts.  Finally,  in  Chapter  5, 1 
used  techniques  adapted  from  human  speech  processing  to  test  whether  stereotyped 
Norwegian  calls  could  be  represented  by  a  set  of  flexibly  arranged  and  smaller  phonemic 
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segments,  exploring  the  possibility  of  vocal  syntax  in  these  animals.  I  offered  my 
conclusions  in  Chapter  6. 

Appendices  1  -  3  include  additional  data  plots  that  are  referenced  by  the  dissertation 
data  chapters.  Appendices  4  —  6  contain  either  published  or  submitted  first  author 
manuscripts  that  pertain  to  different  subjects  than  the  primary  narrative  arc  of  culture  in 
killer  whales.  Appendix  4  is  a  manuscript  on  the  signature  content  of  two  vocalization 
classes,  combined  tonal/pulsed  signals  and  whistles,  produced  by  two  adult  male  narwhals 
( Monodon  monoceros)  tagged  off  the  shores  of  Admiralty  Inlet  on  Baffin  Island,  Canada.  It  was 
published  in  the  journal  of  the  Acoustical  Society  of  America  in  September  2006.  In  Appendix  5,  1 
explored  whether  sperm  whales  tagged  in  the  Ligurian  Sea  displayed  any  consistent  angular 
response  relative  to  a  source  vessel  producing  mid-frequency  sonar  pings  as  a  function  of 
sound  exposure  level.  This  document  was  co-authored  by  Peter  Tvack  and  Andrew  Solow 
and  we  are  revising  it  based  on  reviewer  comments  before  resubmitting  to  Marine  Mammal 
Science  for  publication.  Appendix  6  reports  on  the  high  variability’  of  sound  exposure  levels 
as  a  function  of  range  for  several  kinds  of  acoustic  harassment  and  deterrent  devices  used  to 
discourage  marine  mammals  from  feeding  on  stock  caught  by  fisheries.  It  explored  the 
consequences  of  an  animal  attempting  to  minimize  its  exposure  upon  encountering  a 
complicated  3D  field  of  levels.  Originating  as  a  class  project  for  a  summer  bioacoustics 
course  that  1  attended  in  Tjarno,  Sweden  in  2005, 1  have  submitted  this  manuscript  with  8 
co-authors  to  Marine  Mammal  Science. 

1.3.1  Field  procedure:  Daily  sailing  routine  and  tagging  procedure 

To  collect  the  continuous  movement  and  acoustic  data  required,  digital  archival  tags 
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(DTAGs,  Johnson  &  Tyack,  2003)  were  attached  to  individual  free-ranging  killer  whales 


Figure  1.2.  DTAG  on  killer  whale  from  the  2006  field  season.  Photograph  courtesy  of  Cathy  Harlow. 


(Figure  1.2).  All  field  work  was  conducted  during  November  2005  and  2006.  Tagged 
animals  were  visually  tracked  and  their  range,  bearing  and  behavior  were  recorded  for 
subsequent  ground  truthing  with  the  tag  data  and  for  monitoring  the  behavioral  context  of 
both  the  tagged  animal’s  group  and  other  neighboring  groups.  Two  vessels  were  used  in 
both  years:  a  12m  sailboat  lolaire  observation  platform  and  a  small  RH1B  from  which  we 
deployed  and  recovered  the  tags.  In  2006,  we  benefited  from  an  additional  set  of  platforms: 
the  research  vessels  Sverdrup  and  Nekken  and  two  RHIBs.  The  lolaire  or  Nekken  followed  the 
first  tagged  animal  (defined  as  the  focal)  and  its  group,  towing  the  hydrophone  array  at  a 
relatively  constant  range  of  ~100m  from  the  animals  until  the  tag  released,  allowing 
reasonably  close  visual  inspection  without  excessive  maneuvering  near  the  animals.  Focal 
follows  were  necessary  for  both  tag  recovery  purposes  and  behavioral  tracking  of  the  tagged 
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animal’s  group.  Once  the  tags  released  and  were  recovered,  data  were  offloaded  ashore  and 
the  tag  was  recharged  and  sterilized  for  subsequent  use.  The  effort  was  considered  complete 
once  all  of  the  tags  were  recovered  and  there  was  no  longer  sufficient  daylight  to  attempt 
further  tagging.  Because  it  was  difficult  to  follow  animals  in  bad  weather  against  the  wind 
and  the  waves,  tagging  was  not  attempted  when  the  weather  forecast  was  poor  and  aborted 
if  conditions  grew  too  severe  or  dangerous  while  at  sea. 

Across  both  years,  we  achieved  15  tag  carries  ranging  between  1.2  and  5.5  hours 
combined  with  visual  tracking  and  frequent  photo-identification  including  three  pairs  of 
simultaneous  tag  deployments,  two  of  which  were  on  animals  belonging  to  the  same  social 
group.  Endeavoring  to  tag  more  than  one  whale  in  a  group  was  a  high  science  priority 
because  additional  recorders  allowed  an  examination  of  association  and  synchronv  of  group 
members  during  carousel  feeding  and  other  behaviors.  This  data  collection  scenario  would 
help  reveal  whether  individuals  engage  in  same  or  different  activities  during  a  particular 
behavioral  episode,  providing  preliminary7  observations  of  what  might  be  considered 
coordination  or  role  playing.  Animals  were  tagged  during  both  foraging  and  non-foraging 
contexts.  Three  tag  carries  in  2006  occurred  during  a  behavioral  response  stuy  of  the  effects 
of  sonar.  Only  the  pre-exposure  periods  from  these  recordings  were  considered  here, 
reducing  the  dataset  to  14  tag  carries  lasting  between  0.6  and  4.7  hours  (Table  1.1).  In  total, 
32.2  hours  of  movement  data  and  33.3  hours  of  vocal  data  were  analyzed.  (A  low  battery' 
contributed  to  sensor  calibration  difficulties  for  one  of  the  deployments,  causing  the 
discrepancy  in  the  durations  of  the  movement  and  vocal  data.)  The  hydrophone  array  data 
were  not  analyzed  here. 
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animal  id 

duration  (h) 

notes 

oo05 316a 

2.7 

oo05 320a 

4.7 

Synchronous  tagout  of  two  animals 

oo05 320b 

2.2 

in  different  social  groups 

oo05 321a 

4.2 

Synchronous  tagout  of  two  animals 

oo05 321b 

1.7 

in  the  same  social  group 

oo05 322a 

3.6 

Synchronous  tagout  of  two  animals 

oo05 322b 

3.1 

in  the  same  social  group 

oo05 324a 

2.2 

oo06 313s 

2.2 

oo06 314a 

2.1 

oo06 314s 

2.1 

oo06 317s 

0.6 

oo06_324s 

1.2 

Unable  to  calibrate  all  of  the  movement  sensors 

because  of  low  battery  during  deployment 

oo06 327s 

0.9 

total 

33.3 

Table  1.1.  Tagged  killer  whale  subjects  and  durations  analyzed  in  this  dissertadon. 


1.3.2  Focal  follow  procedure 

The  overall  goal  of  the  focal  follow  was  to  provide  information  on  group  behavior  as 
context  for  the  movement  and  acoustic  data  streams.  At  regular  intervals  once  every  two 
minutes,  data  were  recorded  on  the  dynamics  of  the  focal  group.  This  information  was 
useful  as  a  cross  reference  to  identify  the  time  periods  when  the  animals  were  foraging  and 
paralleled  the  behavioral  state  information  acquired  in  earlier  studies.  In  particular,  milling 
index,  group  formation,  group  size  and  the  presence  of  gulls  interacting  with  the  water’s 
surface  were  noted.  Changes  in  the  milling  index,  direction  of  travel  and  group  composition 
were  recorded  opportunistically,  generally  at  intervals  lasting  longer  than  two  minutes.  One 
of  the  three  simultaneous  tag  deployments  involved  two  animals  from  different  social 
groups.  In  this  instance,  once  the  tag  released  from  the  first  focal  individual,  behavioral 
observations  ended  with  its  group  and  began  with  the  group  containing  the  second  focal 
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animal.  In  the  other  two  instances  of  simultaneous  tagging,  the  two  animals  belonged  to  the 
same  social  group  on  which  behavioral  observations  were  recorded  for  the  duration  of  the 
two  tag  deployments. 

1.4  Summary 

Killer  whale  populations  around  the  world  are  characterized  by  distinctive  foraging 
behaviors  and  vocal  repertoires  of  call  types  that  are  likely  socially  learned  through  cultural 
transmission.  In  my  dissertation,  I  use  digital  archival  tag  data  to  explore  the  movement  and 
carousel  feeding  behaviors  of  free-ranging  individual  Norwegian  killer  whales  and  the 
detailed  vocal  behavior  and  syntax  of  their  groups.  I  conclude  by  discussing  how  these 
results  impact  future  work,  which  should  continue  to  probe  the  nexus  of  social  behavior  and 
learning. 
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Chapter  2.  Individual  Behaviors  of  Carousel  Feeding 
Norwegian  Killer  Whales 

2.1  Abstract 

Among  marine  mammals,  group  foraging  that  facilitates  consumption  of  evasive  or 
large  prey  has  generally  remained  unexplored  from  the  perspective  of  the  individual 
predators.  Norwegian  killer  whales  ( Orcinus  orca)  herd  herring  from  depth  into  balls  at  the 
surface,  incapacitate  them  with  tail  slaps  and  feed  on  the  stunned  fish  one  by  one  in  a 
behavior  called  carousel  feeding.  I  explored  carousel  feeding  at  the  level  of  the  individual 
killer  whale  by  analyzing  data  of  digital  archival  tags  that  recorded  each  animal’s  depth, 
orientation  and  acoustic  environment.  Bouts  of  carousel  feeding  were  defined  based  on 
acoustic  evidence  of  tail  slaps.  Measures  of  changes  in  orientation  and  depth,  fluking  energy 
and  variation  in  heading  all  showed  distinct  increases  during  carousel  feeding  compared  to 
other  time  periods.  Using  a  rapid  change  in  pitch  angle  to  determine  which  tail  slaps  were 
produced  by  the  tagged  animal,  1  found  that  most  tail  slaps  occurred  at  shallow  depths 
within  20m  of  the  surface.  Two  synchronously  tagged  animals  revealed  similar  dive  profiles 
when  tail  slapping  within  60s  of  one  another,  suggesting  that  the  location  and  geometry  of 
the  herring  ball  was  influencing  their  tail  slapping  movements.  A  linearity7  index 
measurement  was  used  to  split  the  dataset  into  periods  of  high  and  low  circuitousness;  two 
behavior  patterns  preceding  carousel  feeding  were  evident.  In  the  first,  killer  whales  initiated 
tail  slapping  behavior  after  a  period  of  directional  swimming,  suggesting  an  absence  of 
corralling  at  depth.  Similar  to  observations  reported  by  earlier  studies,  the  second  sequence 
involved  killer  whales  engaging  in  broad  looping  movements  consistent  with  corralling 
before  commencing  their  tail  slapping  activity.  Together,  these  results  indicate  that  carousel 


43 


feeding  is  fluid  and  opportunistic.  Individual  animals  maneuver  within  the  group  to  collect 
and  herd  herring  either  already  at  the  surface  or  originating  at  depth  and  they  maintain  the 
geometry  of  the  fish  ball  as  long  as  possible  during  feeding. 

2.2  Introduction 

The  extent  to  which  animals  forage  on  their  own  or  in  groups  depends  on  the 
distribution  of  food  resources,  the  ease  and  relative  payoffs  of  locating,  grazing,  or  hunting 
food  independendy  or  collectively,  and  the  nature  and  dynamics  of  the  social  interactions  of 
the  population.  Multiple  individuals  of  a  species  may  aggregate  at  a  plentiful  food  patch 
without  interacting  or  they  may  engage  with  one  another  through  competition  or 
cooperation.  Competition  arises  when  animals  exploit  or  monopolize  a  food  source  at  the 
expense  of  others.  Cooperative  foraging,  the  focus  of  this  chapter  involves  individuals  that 
work  together  to  locate,  incapacitate,  handle,  and/or  feed  on  prey.  Kin  selection  rewards 
cooperation  among  related  individuals  by  enhancing  the  inclusive  fitness  of  the  participants 
(e.g.,  lions  (Panthera  leo).  Packer  et  al.,  1990).  Cooperation  can  also  arise  under  reciprocal 
altruism,  the  serial  exchange  of  beneficial  behavior  between  related  or  non-related 
individuals  (e.g.,  vampire  bats  ( Desmodus  rotundus),  Wilkinson,  1984;  dolphins,  Connor  & 
Norris,  1982).  In  other  cases  of  social  foraging,  the  nutritional  benefit:cost  ratio  of  feeding 
together  must  exceed  that  associated  with  resource  competition  and  independent  hunting 
(Macdonald,  1983;  Nudds,  1978;  Clark,  1986;  Packer  &  Ruttan,  1988).  Modeling  suggests 
that  cooperative  foraging  behaviors  are  likely  to  persist  when  hunting  success,  defined  in 
terms  of  either  the  number  or  size  of  the  prey  being  sought,  improves  in  groups  versus  when 
alone,  a  pattern  borne  out  by  wild  Tai  chimpanzees  (Pan  troglodytes-.  Boesch,  1994),  captive 
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black-headed  gulls  (Icarus  ridibundur.  Gotmark  et  al.,  1986)  and  mammal-eating  transient  killer 
whales  ( Orcinus  orca :  Baird  &  Dill,  1996). 

Cooperative  activity  can  enhance  foraging  efforts  by  1)  making  the  search  phase  for 
food  more  efficient,  2)  allowing  animals  to  capture  prey  that  would  be  difficult  to  secure 
alone,  and  3)  introducing  a  division  of  labor  in  which  individuals  specialize  repeatedly  on 
different  tasks  within  the  group  (though  specialization  can  be  inefficient  if  animals  are 
unable  to  behave  flexibly).  1  will  consider  each  of  these  three  points  in  turn.  First,  foraging 
in  groups  often  arises  to  optimize  the  search,  especially  when  food  patches  are  evanescent 
and  large  but  challenging  to  locate,  rendering  them  difficult  to  monopolize  via  territoriality 
(Dittus,  1984;  Elgar,  1986;  Brown  et  al.,  1991;  Wilkinson,  1992;  Cocroft,  2005).  Second,  the 
simultaneous  (yet  possibly  dispersed)  pursuit  of  prey  allows  predators  to  develop 
countermeasures  against  some  defenses  of  their  quarry7  (e.g.,  Creel  &  Creel,  1995).  Multiple 
pairs  of  spinner  dolphins  ( Stenella  longirostris)  near  Hawai’i,  for  example,  overcame  the  evasive 
and  dynamic  schooling  tendencies  of  their  micronekton  prey  by  echolocating  to  track  the 
horizontal  and  vertical  excursions  of  these  fishes,  shrimps  and  squid  and  herding  them  into  a 
more  accommodating  three  dimensional  (3D)  geometry7  for  capture  (Benoit-Bird  &  Au, 
2003).  One  of  the  critical  features  of  this  study  involved  the  ability  to  map  the  distribution 
of  both  dolphins  and  prey  simultaneously.  The  dolphins  clearly  demonstrated  well 
coordinated  behavior.  Although  the  patterns  of  prey  response  and  the  structured  formation 
of  these  dolphin  pairs  were  taken  as  evidence  for  cooperative  foraging,  the  methods  were 
unable  to  track  individuals  through  time  or  detect  the  fine-scale  movements  of  either 
predator  or  prey.  To  demonstrate  cooperation,  the  following  additional  observations  would 
have  been  required:  each  animal  involved  was  participating  in  the  effort  either  consistendv  or 
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via  staggered  turn  taking,  all  individuals  received  an  opportunity  to  eat  and  the  dolphin 
group  responded  globally  to  alteration  in  3D  prey  structure. 

Third,  under  certain  circumstances,  cooperative  foraging  allows  role  specialization  in 
which  individual  animals  each  repeatedly  conduct  a  particular  task  within  the  collective 
effort.  Stander  (1992)  distinguished  between  Namibian  lioness  “wings”  that  initiated  hunts 
by  stalking  and  circling  their  prey  and  lioness  “centers”  that  captured  the  prey  once  it  was 
driven  towards  them.  Similarly,  among  two  bottlenose  dolphin  ( Tursiops  truncatus)  groups  in 
the  Cedar  Keys,  Florida,  one  animal  repeatedly  assumed  the  role  of  the  “driver”  that  steered 
the  fish  towards  2-5  closely  spaced  “barrier”  animals  (Gazda  et  al.,  2005).  In  another 
example,  once  groups  of  Dusky  dolphins  (Lagenorhynchus  obscurus)  corralled  southern 
anchovies  ( Engraulis  anchoita)  into  a  ball  at  the  surface,  individual  animals  took  turns  to  break 
rank  and  swim  through  the  ball  to  eat  a  mouthful  of  fish  before  rejoining  the  group 
corralling  effort  (Wiirsig  &  Wiirsig,  1980;  Wiirsig,  1986).  Larger  groups  fed  more  of  the  time 
and  for  longer  periods  than  smaller  groups.  Finally,  female  humpback  whales  ( Megaptera 
novaeangliae)  in  Alaska  formed  a  tight  spatial  configuration  during  bubble  net  feeding  on 
euphausiids  and  vertical  lunge-feeding  on  herring  ( Clupea  harengus)  in  a  manner  suggestive  of 
a  division  of  labor  (D’Vincent  et  al.,  1985).  A  call  was  produced  that  preceded  feeding 
sequences  in  which  a  ring  of  whales  assumed  individually-consistent  positions  and  ascended 
to  enclose  and  trap  their  prey  against  the  air-water  interface.  Role  specialization  can  be 
demonstrated  when  different  functional  behaviors  and  spatial  configurations  are  executed 
consistently  by  the  same  animals  within  a  group. 

Killer  whales  display  a  striking  array  of  population-specific  feeding  strategies,  some 
of  which  show  evidence  for  collective  foraging  and  food  provisioning  (e.g.,  Smith  et  ah, 
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1981;  Hoelzel,  1991;  Baird  &  Dill,  1995;  Ford  &  Ellis,  2006).  Transient  killer  whales  in 
British  Columbia,  for  example,  often  form  groups  of  three  animals  to  fetch  the  most  amount 
of  meat  for  their  combined  effort  (Baird  &  Dill,  1996;  Baird  &  Whitehead,  2000).  This 
optimal  group  size  allows  them  to  locate  and  hunt  their  marine  mammal  prey  without  being 
easily  detected.  Fish-eating  resident  killer  whales  travel  in  much  larger  groups,  however,  and 
do  not  share  food  or  depend  on  one  another  to  catch  prey  that  is  easily  hunted  and  handled 
individually  (see  Baird  et  al.,  1992;  Baird  &  Whitehead,  2000).  Historically,  the  behavioral 
states  that  have  captured  these  dynamics  (e.g.,  feeding,  traveling,  group  size  and 
composition)  have  been  defined  based  on  surface  observations  of  killer  whale  group  activity 
(Bain,  1986;  Morton  et  al.,  1986).  This  group  follow  approach  (see  Mann,  2000;  Whitehead, 
2004),  however,  has  not  considered  underwater  or  individual  activity'.  To  understand  the 
patterns  of  group  foraging  in  3D,  it  is  crucial  to  examine  the  continuous  behavior  of  the 
individual  killer  whales  within  these  groups.  This  will  both  provide  information  on  how 
actively  each  animal  participates  in  the  feeding  effort  and  verify  that  all  group  members  are 
indeed  afforded  the  chance  to  eat  (except  for  younger  animals  that  feed  by  suckling  milk),  as 
long  as  feeding  corresponds  with  a  noticeable  3D  orientation  signal. 

Norwegian  killer  whales  partake  in  a  group  foraging  sequence  called  carousel  feeding 
in  which  animals  feed  on  schools  of  herring,  apparently  herding  the  fish  from  depth, 
corralling  them  into  a  tight  ball  against  the  surface  of  the  water  and  tail  slapping  the  edge  of 
the  ball  to  incapacitate  the  fish  and  eat  them  one  at  a  time  (Christensen,  1978;  Simila  & 
Ugarte,  1993).  Earlier  work  has  suggested,  but  not  demonstrated  rigorously,  that  these 
animals  are  interacting  cooperatively.  It  is  likely  that  such  a  group  strategy  boosts  the 
benefiticost  ratio  for  the  individuals  participating  by  improving  the  net  caloric  intake  and 
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managing  an  otherwise  evasive  prey  species.  This  behavior  could  be  motivated  by  kin 
selection  (since  they  live  in  family  groups)  or  reciprocal  altruism  (since  group  composition  is 
stable).  Diving  up  to  160-180  m  to  locate  herring  patches,  different  groups  of  killer  whales 
converge  first  to  herd  a  large  swath  of  herring  away  from  the  total  aggregation  and  then 
fragment  it  into  smaller,  more  manageable  schools  as  they  drive  the  fish  to  shoal  (Nottestad 
&  Simila,  2001;  Nottestad  et  al.,  2002).  The  sonar  technology  used  by  these  earlier  studies  to 
describe  the  corralling  behavior  at  depth  documented  the  generic  movements  of  all  animals 
within  a  group  simultaneously  without  tracking  individual  whales  continuously  through  time. 

Once  the  herring  have  been  gathered  at  the  surface,  the  orcas  further  corral  them 
into  a  small,  compact  and  circular  ball  by  swimming  around  the  fish,  flashing  their  white 
ventral  sides  and  releasing  bubbles  (Simila  &  Ugarte,  1993;  Nottestad  &  Axelsen,  1999). 

They  capitalize  on  some  of  the  same  herring  responses  that  the  lunge-feeding  humpback 
whales  exploit  (D’Vincent  et  al.,  1985;  Sharpe  &  Dill,  1997).  Individual  killer  whales 
sometimes  lunge  into  the  school,  presumably  to  steer  the  fish  since  no  feeding  was  observed 
at  this  time  (Domenici  et  al.,  2000).  The  animals  tail  slap  the  edge  of  the  ball  and  stun 
herring  likely  through  cavitation  or  direct  contact,  some  of  which  are  then  consumed  (Simon 
et  al.,  2005).  Killer  whale  tail  slaps  can  exceed  herring  escape  velocities  and  accelerations 
(Domenici  et  al.,  2000;  Domenici,  2001).  The  absence  of  direct  feeding  by  engulfing 
mouthfuls  of  fish  suggests  an  alternative  strategy  to  the  Dusky  dolphins  mentioned  earlier. 
Here,  the  killer  whales  maintain  the  geometry  and  density  of  the  ball,  perhaps  to  prevent 
herring  from  escaping  laterally  or  vertically.  Differences  in  the  response  behavior  of  the 
prey,  the  compactness  of  the  fish  ball  and  the  size  of  the  predators  may  contribute  to  the 
distinct  feeding  strategies  pursued  by  these  two  cetacean  populations.  Carousel  feeding  at 
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the  surface  both  pushes  the  herring  against  an  air-water  barrier  and  decreases  the  vertical 
distance  that  the  orcas  must  traverse  to  take  a  breath.  Carousel  feeding  allows  the  killer 
whales  to  successfully  manage  the  herring’s  deep  vertical  migradon,  schooling  behavior  and 
enhanced  maneuverability. 

Much  of  the  previous  research  has  explored  the  overall  carousel  feeding  sequence 
from  the  level  of  the  group.  Although  the  work  of  Domenici  et  al.  (2000)  quantified 
movement  features  of  individuals,  they  averaged  the  observed  behaviors  over  all  group 
members  because  they  were  unable  to  track  the  same  animals  continuouslv  once  they  moved 
out  of  view  on  the  video  recording.  Observing  the  behavior  of  an  individual  participating  in 
cooperative  foraging  has  been  more  straightforward  for  terrestrial  than  marine  animals  but 
this  kind  of  approach  is  essential  for  exploring  individual-  versus  group-specific  behaviors 
and  the  possibilities  of  role  playing  and  turn  taking  (i.e.,  changing  subsets  of  killer  whales 
corralling  and  tail  slapping  the  fish  over  the  course  of  a  single  carousel  feeding  episode). 

Here,  1  analyze  and  present  unbiased  and  continuous  data  gathered  from  digital 
archival  movement  and  audio  tags  deployed  on  free-ranging  Norwegian  killer  whales.  1 
explore  the  behaviors  of  individual  killer  whales  operating  within  their  larger  foraging 
groups.  In  addition,  1  describe  the  detailed  spatial  and  temporal  patterns  of  the  horizontal 
and  vertical  movements  of  carousel  feeding  animals  and  examine  the  consequent 
implications  of  individual  variability  and  role  playing.  After  outlining  the  general  methods  in 
the  next  section,  I  will  present  3  analyses.  The  first  broadlv  characterizes  the  differences  in 
the  values  and  variability  of  the  movements  and  depth  of  the  animals  between  periods 
involving  tail  slapping  and  periods  that  do  not.  Second,  I  examine  tail  slaps  produced  by  the 
tagged  animal  more  closely,  developing  an  approach  to  distinguish  them  from  those 
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produced  by  other  group  members  and  exploring  the  diving  patterns  associated  with  this 
movement  signature.  The  third  analysis  considers  these  two  sets  of  results  together  with  a 
linearity  index  of  movement  to  explore  the  behavioral  sequences  preceding  successful 
carousel  feeding  episodes. 

2.3  General  materials  and  methods 

I  conducted  field  work  in  Tysfjord  and  Vestfjord,  Norway,  in  the  fjord  system  just 
south  of  the  Lofoten  islands  (~68°15’  N,  ~16°E).  These  fjords  receive  a  massive  influx  of 
spring-spawning  herring  in  the  winter  time,  attracting  foraging  killer  whales  (Simila  et  al., 
1996).  Digital  archival  tags  (DTAGs:  Johnson  &  Tyack,  2003)  were  attached  from  a  rigid 
hull  inflatable  boat  with  a  7  m  hand  pole  to  8  orcas  in  November  2005  and  to  7  orcas  in 
November  2006  for  between  1.2  and  5.5  hours  each.  Tag  deployment  durations  were 
deliberately  short  because  of  limited  daylight  and  difficult  nighttime  tracking  conditions. 
These  tags  sampled  sound  at  96  kHz  and  movement  at  50  Hz  via  a  tri-axial  accelerometer,  a 
tri-axial  magnetometer  and  a  pressure  sensor  for  depth.  Although  some  of  the  animals  in 
the  2006  dataset  were  introduced  to  sonar  after  a  quiet  pre-exposure  session,  only  time 
periods  before  these  exposures  were  analyzed  here.  In  addition,  the  movement  record  of 
one  animal  (oo06_324s)  could  not  be  calibrated  due  to  a  low  battery.  These  constraints 
reduced  the  dataset  to  13  animals  and  32.3  tag  recording  hours  in  all.  Table  2.1  lists  the  13 
animals  that  were  tagged,  tagout  durations,  sex  and  age  class  when  known,  maximum  dive 
depth  and  data  related  to  tail  slapping  behavior  and  measurements.  Tagged  killer  whales 
were  tracked  using  the  tag’s  VHF  beacon.  The  behavioral  state  of  the  group  was  monitored 
visually  from  aboard  the  sailboat  lolaire  or  the  research  vessel  Sverdrup.  Tags  were 
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programmed  to  release  from  the  animal  at  a  pre-determined  time,  floadng  to  the  surface 
where  they  were  then  located  using  the  VHF  signal.  Data  were  offloaded  in  the  field  and 
saved  to  CD.  All  movement  data  were  subsequendy  calibrated  to  convert  from  the  tag 
frame  to  the  pitch  (-90°  =  pitched  vertically  downwards;  +90°  =  pitched  vertically 
upwards),  roll  (0°  =  dorsal  side  up;  180°  =  ventral  side  up)  and  heading  (compass  bearing)  of 
the  whale  frame  (PRH:  Johnson  &  Tyack,  2003;  Miller  et  al.,  2004b).  The  acoustic  records 
were  audited  manually  by  listening  and  logging  each  acoustic  event. 

2.4  Analysis  1:  General  movement  and  depth  features 

Compared  with  periods  of  travel  or  resting,  killer  whales  that  are  carousel  feeding 
must  maneuver  and  reorient  continuously  to  maintain  the  herring  ball  (Simila  &  Ugarte, 

1993;  Domenici  et  ah,  2000).  In  this  analysis,  a  suite  of  movement  measurements  and  their 
variability  were  broadly  compared  between  periods  of  tail  slapping  (TS),  which  served  as  an 
initial  proxy  for  feeding  activity,  and  periods  of  not  tail  slapping  (NTS). 

2.4.1  Methods 

2.4. 1 . 1  Identifying  tail  slaps  and  tail  slapping  periods 

A  tail  slap  produced  a  characteristic  acoustic  signature  (Simon  et  ah,  2005;  Van 
Opzeeland  et  ah,  2005)  that  could  be  detected  when  listening  through  the  tag  sound 
recordings.  TS  periods  were  defined  as  the  2-min  window  flanking  any  set  of  at  least  3  tail 
slaps  occurring  within  2-min  of  one  another.  These  thresholds  were  chosen  to  decrease  the 
chance  of  including  faint  or  brief  sounds  produced  by  other  percussive  sources  that 
resembled  tail  slapping.  To  ignore  bouts  of  tail  slapping  involving  a  group  composed 


entirely  of  non-tagged  animals  that  were  detected  acoustically  at  a  distance,  TS  periods  were 
additionally  confined  to  bouts  in  which  the  tagged  or  focal  animal  produced  at  least  a  single 
tail  slap.  (These  focal  tail  slaps  were  accompanied  by  a  concomitant  pitch  change  signature. 
See  Analysis  2  for  more  detail.)  The  remaining  sections  of  the  record  were  considered  to  be 
NTS  periods.  Tail  slap  clustering  within  each  bout  was  assessed  by  computing  the 
Greenwood  statistic  on  the  temporal  spacings  between  tail  slap  events  (Greenwood,  1946; 
Stephens,  1986)  and  comparing  it  to  1000  datasets  generated  under  the  null  hypothesis  of  a 
uniform  distribution. 

2 A.  1 .2  Derived  movement  measurements 

To  explore  the  variability  of  the  animals’  movements,  1  computed  the  circular 
standard  deviation  of  the  derivative  of  both  the  pointing  angle  and  roll  measures  and  the 
conventional  standard  deviation  of  the  vertical  velocity  (depth  derivative)  once  per  second. 
The  pointing  angle  collapsed  the  pitch  and  heading  measurements  into  a  single  variable  that 
described  the  3D  orientation  of  the  whale’s  longitudinal  axis  (Miller  et  al.,  2004a).  1  also 
computed  the  residual  heading  of  the  animal  by  first  low-pass  filtering  (to  remove  high 
frequency  fluking  activity  using  a  cut-off  frequency  of  half  of  0.43  Hz;  see  Sato  et  al.,  2007 
for  the  computation  of  the  mean  stroke  cycle  frequency  of  killer  whales)  and  then  high-pass 
filtering  (to  remove  slow  maneuvering  or  trend  of  the  animal  over  a  30s  window)  the 
calibrated  heading  data.  The  fluking  intensity  of  the  animal  was  computed  by  band-pass 
filtering  the  z-axis  of  the  accelerometer  using  frequencies  of  0.3-0.5s,  squaring  the  result  and 
then  taking  the  running  average  (see  Miller  et  al.,  2004b;  Hooker  et  al.,  2005).  Because  the 
measurements  associated  with  focal  tail  slaps  were  considered  explicitly  by  Analysis  2  (see 
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below),  the  data  contained  in  the  time  immediately  spanning  each  focal  tail  slap  were 
removed  before  comparing  the  TS  and  NTS  episodes.  Inter-dive  intervals  (IDIs),  the 
lengths  of  time  spent  at  or  near  the  surface  between  dives,  were  calculated  for  all  dives 
exceeding  10  m  in  depth. 

2.4.2  Results  and  discussion 

Nine  of  the  13  whales  exhibited  both  TS  and  NTS  periods;  the  remainder  contained 
only  NTS  episodes.  None  of  the  maximum  dive  depths  in  Table  2.1  exceeded  the  160-180 
m  depths  observed  by  Nottestad  et  al.  (2002),  suggesting  the  herring  were  located  at 
shallower  depths  here.  The  killer  whales  restricted  98%  and  72%  of  their  overall  diving  to 
within  the  upper  50  m  and  20  m,  respectively  (Figure  2.1).  More  specifically,  they  tended  to 
stay  near  the  surface  regardless  of  their  activity  state,  though  the  deepest  dives  occurred 
during  and  often  just  prior  to  tail  slapping  (TS)  periods  (Figure  2.1,  Appendix  1).  Despite 
animals  spending  comparatively  less  time  in  the  depth  bin  closest  to  the  surface  during  TS 
than  NTS  periods,  TS  episodes  were  characterized  by  a  greater  proportion  of  time  spent 
occupying  the  depth  bins  between  5  and  25  m.  The  frequency  of  occupying  deeper  depths 
decreased  monotonically  for  TS  periods  and  non-monotonically  for  NTS  periods.  A  greater 
number  of  dives  between  10  and  20  m  occurred  during  TS  compared  to  NTS  periods 
(Figure  2.2,  top;  49%  vs.  21%).  This  tendency  of  TS  periods  to  contain  shallower  dives  was 
likely  due  to  the  whales  interacting  with  the  herring  ball  close  to  the  surface. 

Short  inter-dive  intervals  were  observed  more  frequently  and  deep  excursions 
occurred  more  quickly  during  TS  compared  with  NTS  periods  (Figure  2.2).  During  TS 
periods,  73%  of  inter-dive  intervals  (IDIs)  lasted  less  than  1  minute,  which  was  considerably 
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all  periods 
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depth  (m) 
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Figure  2.1.  Left:  cumulative  plot  of  depth  for  all  periods.  Percentage  of  total  time  spent  in  different  depth 
bins  during  tail  slapping  (center,  5.4h)  and  not  tail  slapping  (right,  26.7h)  periods.  Both  panels  are  bounded  by 
the  same  axis  limits. 


higher  than  the  value  of  19%  observed  during  NTS  periods  (Figure  2.2,  bottom).  There 
were  43  instances  in  which  IDIs  exceeded  4  minutes  during  NTS  periods  compared  to  none 
during  TS  periods.  The  variability  of  the  depth  did  not  reveal  any  consistent  differences 
across  animals  between  TS  and  NTS  periods  (Figure  2.3).  The  vertical  velocity,  change  in 
poindng  angle,  change  in  roll  movement  and  fluking  intensity  measures,  however,  were  more 
variable  during  TS  than  NTS  periods  for  these  animals  except  for  the  vertical  velocity  of  a 
single  whale  (Figure  2.3).  Because  the  time  surrounding  each  focal  tail  slap  was  withheld  to 
calculate  the  variability  of  the  movement  measures,  Figure  2.3  indicates  that  the  animals  were 
moving  more  variably  during  TS  segments  even  when  they  were  not  actually  producing  tail 
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Figure  2.2.  Top:  Histograms  of  maximum  depth  for  each  dive  exceeding  10m  for  tail  slapping  periods  (left) 
and  not  tail  slapping  periods  (right).  Bottom:  Histograms  of  inter-dive  intervals  between  all  dives  exceeding 
10m  for  tail  slapping  periods  (left)  and  not  tail  slapping  periods  (right).  Each  horizontal  pair  of  panels  is 
bounded  by  the  same  axis  limits. 


slaps.  Given  relatively  constant  running  mean  values  of  the  movement  measures,  this 
increased  variability  may  have  elevated  the  aerobic  demands  of  the  animals,  requiring  more 
frequent  trips  to  the  surface  to  breathe,  briefer  excursions  to  depth  and  a  consequent  need  to 
keep  the  herring  ball  near  the  surface.  (Of  course,  feeding  on  the  herring  ball  at  the  surface 
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Figure  2.3.  Standard  deviation  of  5  movement  measures  (see  text  for  details)  for  the  9  animals  exhibiting  both 
tail  slapping  (TS,  black  bars)  and  not  tail  slapping  (NTS,  white  bars)  periods.  The  panels  in  the  upper  and 
middle  left  contain  circular  datasets,  requiring  the  calculation  of  the  circular  standard  deviation  instead  of  the 
conventional  standard  deviation  used  for  the  remaining  panels.  The  first  four  characters  of  the  whale 
identification  labels  have  been  dropped  for  legibility.  The  lengths  of  the  time  periods  of  TS  and  NTS  periods 
are  available  in  Table  2.1. 


also  allowed  the  whales  to  trap  the  fish  against  the  air-water  interface.) 

Six  of  the  nine  whales  contained  at  least  one  TS  period  with  tail  slaps  that  were 
significandv  more  clustered  than  expected  if  they  were  distributed  uniformly  (Greenwood 
stadsdc).  This  suggests  two  different  temporal  regimes  for  tail  slapping  behavior:  one  in 
which  tail  slaps  clustered  in  time  (7  of  18  TS  periods,  1:  P  <  0.05;  2:  P  <  0.01;  4:  P  <  0.001) 
and  another  in  which  tail  slaps  were  produced  more  uniformly  in  time  (11  of  18  TS  periods). 
Group  tail  slap  rates  were  higher  than  previously  reported,  varying  between  0.7  and  9.6  per 
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minute  during  carousel  feeding.  Tail  slap  rates  from  individual  tagged  whales  ranged 
between  0.2  and  0.7  per  minute. 

2.5  Analysis  2:  Focal  tail  slapping 

The  tail  slap  is  a  distinctive  feature  of  carousel  feeding  and  it  offered  a  salient 
acoustic  and  movement  signature  pairing  when  produced  by  the  tagged,  focal  animal.  Here, 
I  quantify  the  more  detailed  movements  surrounding  and  characterizing  focal  tail  slaps. 

2.5.1  Methods 

Tail  slapping  individuals  undergo  a  concomitant  pitch  change  as  they  move  through 
a  partial  vertical  turn  resulting  from  the  momentum  of  the  slap  (Simila  &  Ugarte,  1993; 
Domenici  et  al.,  2000;  see  printed  and  supplementary7  video  material  from  Simon  et  al., 
2005).  The  subset  of  tail  slaps  meeting  an  individual-specific,  minimum  instantaneous 
change  in  pitch  threshold  were  assigned  to  the  tagged  animal  (i.e.,  considered  to  be  focal  tail 
slaps).  Figure  2.4  contrasts  the  consistent  change  in  pitch  associated  with  10  randomly 
selected  focal  tail  slaps  with  the  more  constant  pitch  values  of  10  randomly  selected  non- 
focal  tail  slaps.  Often,  focal  tail  slaps  were  preceded  by  elevated  flow  noise,  presumably  due 
to  water  moving  faster  over  the  tag  as  the  animal  fluked  into  position.  Just  prior  to  the  tail 
slap,  this  flow  noise  went  quiet.  These  acoustic  features  were  used  to  help  classify  focal  tail 
slaps  as  well.  The  time  stamps  of  the  focal  tail  slaps  were  centered  on  the  zero  crossing  of 
the  pitch  signal  because  this  cue  could  be  reliably  and  consistently  measured.  Alignment  at 
the  onset  of  the  percussive  acoustic  signal  would  have  been  more  variable  because  its 
occurrence  depended  on  the  relative  position,  concentration  and  incapacitation  of  the  fish. 
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Figure  2.4.  Pitch  values  associated  with  the  2s  flanking  10  randomly  selected  focal  (left)  and  non-focal  (right) 
tail  slaps  across  all  whales.  Note  the  consistent  directional  change  in  pitch  associated  with  the  focal  tail  slaps 
compared  to  the  flatter,  more  level  pitch  data  associated  with  the  non-focal  tail  slaps. 

The  relative  proportion  of  focal  to  non-focal  tail  slaps  was  quite  variable  (Table  2.1). 

2.5.2  Results  and  discussion 

Focal  tail  slaps  were  generally  louder  with  a  higher  signal-to-noise  ratio  than  non- 
focal  tail  slaps.  They  were  also  characterized  by  an  instantaneous  change  in  pitch  caused  by 
the  animals  spinning  from  a  negative  pitch  angle  (i.e.,  pointing  downwards)  to  an  opposite 
and  approximately  equal  positive  pitch  angle  (i.e.,  pointing  upwards,  Figure  2.4).  This  led  to 
a  high  kurtosis  for  the  distribution  of  pitch  angles  sampled  at  the  times  of  focal  tail  slaps 
with  most  values  inevitably  stacking  close  to  0°  (Figure  2.5a).  The  distribution  of  pitch 
angles  during  non-focal  tail  slaps  was  more  evenly  distributed  with  an  overall  preference  for 
generally  level  orientation  (Figure  2.5b).  This  trend  was  reversed  for  the  roll  data  (Figure 
2.5c-d).  Most  non-focal  tail  slaps  were  characterized  by  a  rather  level  roll  (^0°,  or  dorsal 
side  up)  but  all  orientations  were  observed.  During  focal  tail  slaps,  however,  the  animals 
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Figure  2.5.  Histograms  of  pitch,  roll  and  heading  measurements  (in  degrees)  taken  during  focal  tail  slap  events 
(left,  N  =  89)  and  non-focal  tail  slap  events  (right,  N  =  981).  Axis  bounds  are  identical  for  each  pair  of  focal 
and  non-focal  measurements.  Focal  tail  slaps  were  centered  on  the  zero  crossing  of  the  pitch  measurement 
(see  text  for  justification). 


were  rolled  between  ±90°  with  only  a  slight  preference  for  orientations  closer  to  0°. 
Although  the  orcas  rotated  their  bodies  somewhat  as  they  tail  slapped,  they  appeared  to  be 
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constrained  within  a  particular  range  of  orientations.  For  example,  no  inverted  tail  slaps 
(ventral  side  up)  were  detected,  suggesting  that  the  animals  were  not  approaching  from  the 
underside  of  the  ball.  This  preferred  set  of  roll  orientations  may  reflect  a  geometry  that 
delivers  sufficient  tail  slapping  force  to  incapacitate  the  herring  that  might  have  themselves 
adopted  a  particular  suite  of  favored  orientations  as  well.  No  differences  between  the 
heading  of  the  animals  during  focal  and  non-focal  tail  slaps  were  observed  (Figure  2.5e-t). 
There  was  only  one  occasion  in  which  a  tail  slap  that  was  accompanied  by  a  substantial 
change  in  heading  was  preceded  by  elevated  flow  noise  and  fluking  energy.  It  was  not 
included  here  as  a  focal  tail  slap.  This  tail  slap  may  have  reflected  an  alternative  movement 
strategy  to  tail  slap  fish  by  rotating  in  a  horizontal  plane  but  apparently  was  deployed  much 
less  frequently  than  the  tail  slaps  produced  by  changing  pitch. 

The  top  two  sets  of  triple  plots  in  Figure  2.6  plot  the  pitch,  depth  and  fluking 
intensity  profiles  for  an  adult  male,  oo05_322b  (top  row),  and  an  adult  female,  oo06_327s 
(middle  row),  during  their  focal  tail  slaps  (see  Appendix  1  for  similar  figures  for  the 
remaining  animals).  The  left  column  depicts  the  pitch  change  accompanying  tail  slaps 
produced  by  the  tagged  animal.  The  center  column  reveals  differences  in  the  diving  profiles 
associated  with  tail  slapping.  The  male  generally  started  at  the  surface  before  diving  to  10- 
20m  to  initiate  a  tail  slap  whereas  the  female  maintained  a  more  constant  depth  level,  often 
closer  to  the  surface,  before  and  during  her  tail  slaps.  IFe  column  to  the  far  right  indicates  a 
general  tendency  of  two  surges  in  fluking  intensity,  the  first  likely  corresponding  to  the 
animal  fluking  into  position  and/or  lunging  (dotted  line  bracket,  see  Introduction  and 
Domenici  et  al.,  2000  for  a  further  description)  and  the  second  to  the  tail  slapping  event 
itself:  (solid  line  bracket).  The  relative  amounts  of  fluking  energy  invested  in  the  lunge  and 
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Figure  2.6.  Top  two  trios  of  panels:  Pitch  (left),  depth  (center)  and  Az  variation  (right)  profiles  centered  on 
focal  tail  slaps  (gray  dotted  line)  produced  by  male  oo05_322b  (top  tnplet)  and  female  oo06_327s  (middle 
triplet).  Each  line  color  corresponds  to  a  unique  focal  tail  slap  and  is  consistent  across  each  horizontal  triptych 
of  plots.  Individual  oo05_322b  initiated  many  of  his  tail  slaps  by  surfacing  and  diving  to  10-20m  whereas 
oo06_327s  remained  at  a  more  constant  depth  for  nearly  all  of  her  tail  slaps.  Fluking  intensity  increased  during 
the  approach  or  lunge  phase  2-3s  preceding  the  tail  slap  (dotted  line  bracket)  and  then  once  the  tail  slap  was 
executed  (solid  line  bracket).  Bottom  panel:  Four  pairs  of  depth  profiles  surrounding  tail  slaps  produced  by 
group  members  oo05_322a  (dotted  line)  and  oo05_322b  (solid  line)  within  60s  of  one  another  (turquoise:  3s, 
green:  12s,  red:  39s,  blue:  54s). 
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tail  slap  phases  varied  between  animals  due  to  tag  placement  but  were  generally  similar 
within  individuals. 

On  one  occasion,  two  carousel  feeding  animals,  an  adult  female  (oo05_322a)  and  an 
adult  male  (oo05_322b),  were  tagged  simultaneously.  No  regular  patterns  were  observed  in 
the  pitch,  roll  and  heading  of  these  two  animals  to  indicate  that  they  were  synchronizing  or 
staggering  their  movement  behaviors  reliably  (data  not  shown  here).  The  bottom  panel  in 
Figure  2.6  compares  the  depth  profiles  associated  with  four  pairs  of  their  tail  slaps  in  the 
same  ball  of  herring  occurring  within  60s  of  one  another  towards  the  end  of  a  feeding 
episode  (the  remaining  pair  was  separated  by  83s).  Hach  painng  showed  a  similar  profile  in 
terms  of  both  the  change  in  depth  5s  preceding  the  tail  slap  to  the  moment  of  the  tail  slap 
(difference  ranged  from  0.5  to  3.5  m  with  .v  -  1.6  m)  and  the  actual  depth  of  the  tail  slaps 
(difference  ranged  from  0.4  to  10.9  m  with  x  =  4.7  m).  Given  the  male’s  tendency  to 
surface  before  diving  to  5-20  m  to  tail  slap  (Figure  2.6,  middle  panel  of  top  row),  this 
matching  of  the  deeper,  more  level  trajectories  was  especially  striking.  This  strong  overlap 
therefore  suggests  that  the  location  and  geometry  of  the  herring  ball  were  primarily 
influencing  the  details  of  tail  slapping  movements.  'ITie  difference  in  profiles  between  this 
male  and  the  other  adult  female  (oo()6_327s,  middle  set  of  panels,  Figure  2.6)  that 
maintained  a  more  level  depth  profile  surrounding  her  tail  slaps,  then,  were  less  likely  to  have 
resulted  from  individually-stereotyped  behaviors. 

The  depths  of  focal  tail  slaps  and  the  changes  in  depth  associated  with  5s  before  to 
the  instant  of  the  tail  slap  are  plotted  in  Figure  2.7  for  the  9  whales  that  produced  their  own 
tail  slaps.  In  light  of  the  earlier  discussion,  the  differences  observed  here  were  most  likely- 
determined  by  the  depth  of  the  hernng  ball.  Female  oot)5_322a  produced  only  5  tail  slaps 
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Figure  2.7.  Box  and  whisker  plots  depicting  the  depths  of  focal  tail  slaps  (top)  and  the  change  in  depth  from 
5s  preceding  the  tail  slap  to  the  moment  of  the  tail  slap  (bottom;  positive  values:  descents;  negative  values: 
ascents)  for  all  9  whales.  The  horizontal  lines  of  each  box  correspond  to  the  lower  quartile,  median  and  upper 
quartile  values  of  each  dataset.  Whiskers  show  the  extent  of  the  data  and  outliers  are  indicated  (+).  The 
sample  size  for  each  of  these  bars  is  given  in  the  penultimate  column  of  Table  2.1. 


over  a  much  shorter  time  period  at  the  very  end  of  the  feeding  sequence  whereas  the 
simultaneously  tagged  male  oo05_322b  produced  30  tail  slaps  throughout  the  carousel 
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Figure  2.8.  Simultaneous  dive  profiles  of  r\vo  carousel  feeding  whales  oo05_322a  and  oo()5_322b,  zoomed 
into  tail  slapping  activity.  Colored  circles  indicate  focal  tail  slaps  (see  legend). 

feeding  episode.  Figure  2.8  shows  the  diving  sequences  and  timing  of  the  focal  tail  slaps  of 
these  two  simultaneously  tagged  animals.  Both  whales  worked  the  vertical  dimension 
actively,  even  as  they  fail  slapped,  hut  did  so  non- synchronously,  presumably  to  preserve  the 
coherence  of  the  prey  ball  (see  Chapter  3).  It  is  possible  that  killer  whales  consumed  the  fish 
that  were  incapacitated  as  a  result  of  their  own  tail  slaps  (see  Simila  &  Ugarre,  1993; 
Domenici  et  al.,  2000).  If  that  were  true,  the  female  in  the  carousel  here  could  have 
consumed  six  rimes  fewer  fish  than  the  male. 

2.6  Analysis  3:  3D  tracks  and  behavioral  sequences 

'ihis  section  considers  the  behavioral  sequences  that  lead  to  successful  foraging 
episodes.  Carousel  feeding  has  been  described  as  essentially  a  two-stage  process  involving 
corralling  herring  from  depth  and  feeding.  More  specifically,  corralling  has  been  associated 
previously  with  animals  first  driving  herring  to  the  surface,  which  required  successively 
shallower  dives,  and  then  herding  the  fish  into  a  compact  ball  (Simila  &  Ugartc,  1993; 
Nottestad  &  Simila,  2001;  Nottestad  et  al.,  2002).  Feeding  consists  of  circling  about  this 
condensed  herring  ball  at  the  surface  and  occasionally  breaking  to  tail  slap  and  feed  (Simila 
&  Ugarre,  1993;  Domenici  et  al.,  2000;  Simon  et  al.,  2005).  The  index  of  2D  travel 
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introduced  in  this  section  relied  explicitly  on  the  movement  data  to  divide  the  dataset  into 
periods  of  low  and  high  circling.  Using  the  data  streams  from  all  three  analyses,  I  examined 
the  carousels  in  light  of  the  corralling  and  feeding  behaviors  previously  described. 

2.6.1  Methods 

The  PRH  data  allowed  the  calculation  of  a  pseudo-track,  a  non-geo-referenced 
inertial  track  of  the  animal’s  2D  movement  path  that  assumed  a  constant  swimming  speed 
and  current  velocity  (see  Johnson  &  Tyack,  2003;  Zimmer  et  al.,  2005).  The  third  dimension 
of  depth  was  measured  directly  with  the  pressure  sensor.  The  pseudo-track  was  not  useful 
for  calculating  absolute  distances  traveled  since  geo-referencing  based  on  both  imperfect 
visual  sightings  and  the  assumption  of  constant  velocity  would  have  introduced  large  errors. 
It  did  allow  relative  measurements  to  be  made,  however,  such  as  track  tortuosity.  I  sought  a 
path-based  metric  of  tortuosity  and  used  a  linearity  index  (LI)  that  was  calculated  and 
smoothed  over  a  sliding  30s  window  by  dividing  the  pseudo-distance  between  the  beginning 
and  ending  of  this  section  of  the  path  (the  crow’s  flight)  by  the  pseudo-path  covered  by  the 
whale  (see  Wilson  et  al,  2007).  While  the  entire  pseudo-track  accumulated  error  over  the 
full  recording,  the  LI  measurement  did  so  only  over  the  length  of  the  short  smoothing 
window  selected  to  capture  path  variation  on  a  tight  time  scale.  For  this  measure,  1 
corresponded  to  a  straight  path  (i.e.,  the  crow’s  path  was  identical  to  the  whale’s  path  and 
roughly  expected  when  the  animal  was  swimming  directionally)  and  0  corresponded  to 
absolute  circuitousness  (i.e.,  no  displacement  since  the  beginning  and  ending  points  were  the 
same).  The  LI  was  smoothed  again  with  a  running  average  filter  and  the  final  measure 
excluded  the  first  and  last  several  minutes  of  the  record  to  accommodate  the  lag  of  the  filter. 
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Dive  duration,  maximum  depth  and  IDI  were  examined  in  relation  to  the  LI  as  well. 

2.6.2  Results  and  discussion 

The  TS  versus  NTS  distinction  established  in  Analysis  1  relied  on  acoustic  evidence 
of  tail  slaps  to  divide  the  dataset.  In  this  analysis,  I  examined  whether  the  LI,  which  derived 
directly  from  the  movement  data,  justified  some  kind  of  division  as  well.  Figure  2.9a  plots  a 
histogram  of  the  linear  index  (Id)  sampled  once  even-  second  across  all  animals.  All  of  the 
LI  data  were  greater  than  0.37  and  84%  were  larger  than  0.80.  The  LI  values  corresponding 
to  82  of  the  89  focal  tail  slaps  (the  remaining  7  tail  slaps  occurred  within  the  4  minute  lag  at 
the  beginning  and  ending  of  each  record  created  by  the  smoothing  window)  are  accumulated 
in  Figure  2.9b.  A  normal  distribution  was  fit  to  these  data  (//  —  0.59,  o  —  0.09)  and  plotted  in 
both  panels  of  Figure  2.9.  It  is  evident  that  focal  tail  slaps  occurred  during  segments  of  the 
record  that  were  characterized  by  lower  linearity  (i.e.,  higher  circuitousncss).  I  used  this 
second  distribution  to  motivate  dividing  the  data  into  low  and  high  LI  segments  of  at  least  2 
minutes  in  duration  using  a  threshold  of  0.76,  the  LI  value  exceeding  95%  of  the  focal  tail 
slap  data.  Compared  with  a  value  exceeding  all  of  the  focal  tail  slap  data,  0.76  was  more 
likely  to  classify  intermediate  measurements  near  the  threshold  as  high  LI.  There  was  a 
slight  distinction  between  dividing  the  data  into  TS  and  NTS  episodes  (Analysis  1)  and  into 
low  and  high  LI  periods  (here).  In  particular,  the  LI  division  included  periods  of  high 
circuitousness  that  may  not  have  resulted  in  successful  focal  tail  slaps  (e.g.,  see  first  low  LI 
episode  in  Figure  2.12).  TS  episodes  were  generally  slightly  longer  than  low  1.1  periods, 
suggesting  that  the  tight  circling  movements  of  the  focal  animal  (low  1 .1)  lasted  longer  than 
the  time  span  in  which  group  members  were  producing  audible  tail  slaps  (l’S  periods). 


67 


(a)  all  tags 


high  circuitousness 

(b)  focal  tail  slaps 


high  linearity 


Figure  2.9.  Top  panel:  Histogram  of  linearity  index  (LI)  gathered  once  even*  second  from  all  whales.  Bottom 
panel:  Histogram  of  LI  values  at  moments  of  focal  tail  slaps  (N  —  82).  A  normal  distribution  was  fit  to  the 
bottom  histogram  and  is  plotted  in  both  panels  as  the  dark  line  (//  =  0.59;  o-  0.09).  The  LI  threshold  of  0.76 
is  plotted  in  red  in  the  top  panel  and  was  computed  to  contain  95%  of  the  data  in  the  lower  panel.  This 
threshold  split  the  data  into  low  (<  0.76)  and  high  (>  0.76)  linearity. 


Table  2.2  provides  behavioral  sequence  narrations  for  all  1 3  animals  based  on 
pseudo-track,  LI  and  direct  and  derived  movement  measurements.  The  derived  movement 
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Tag 

Behavioral  sequence  narration 

oo05_316a 

The  deepest  dive  of  49m  was  followed  by  several  dives  that  became  progressively  shallower.  No 
looping  in  the  track  or  tail  slap  activity  was  observed.  Onlv  high  Li  activity  was  evident. 

oo05_320a 

Periods  of  directional  movement  alternated  with  broad  looping  behavior  for  the  duration  of  this 
recording.  The  first  period  of  broad  looping  occurred  at  the  very  beginning  of  the  record  with  a 
concomitant  shoaling  of  maximum  dive  depth.  Only  a  single  non-focal  rail  slap  was  observed 
during  the  entire  recording.  I  Iigh  LI  activity  characterized  the  entire  tagout. 

oo05_320b 

Two  periods  of  low  LI  and  tail  slapping  at  the  very  beginning  and  ending  of  this  record  were 
separated  by  a  long  episode  of  more  directional  movement.  No  obvious  shoaling  of  maximum 
dive  depth  was  apparent. 

oo05_321a 

The  first  period  of  this  record  was  highly  circuitous  and  contained  tail  slapping  activity.  This  was 
followed  by  an  episode  of  high  LI  containing  two  periods  of  directional  movement  separated  bv 
a  section  of  broad  looping.  The  deepest  dive  occurred  at  the  beginning  of  the  record  and  the 
maximum  dive  depths  became  shallower  with  time.  No  focal  tail  slapping  was  observed  after  the 
initial  high  LI  episode. 

oo05_321b 

This  animal  was  tagged  as  it  traced  out  broad  loops,  corresponding  to  the  same  looping  behavior 
of  oo05_321a.  The  remainder  of  the  track  was  characterized  by  travel  in  a  consistent  direction. 
Dive  durations  and  depths  were  fairly  uniform  for  the  entire  tagout.  The  LI  gradually 
approached  1. 

oo05_322a 

A  long  period  of  initial  directional  movement  was  followed  by  an  episode  of  circling  initiated  by 
the  deepest  dive  of  the  record.  The  maximum  dive  depth  became  progressively  shallower.  The 
low  LI  period  involved  tight  circling  behavior  and  a  handful  of  focal  tail  slaps.  This  was  followed 
by  broad  looping  and  the  record  ended  with  a  return  to  high  LI  directional  swimming. 

oo05_322b 

An  initial  episode  of  broad  looping  transitioned  into  a  tail  slapping  and  low  LI  phase  with 
numerous  focal  tail  slaps.  "Ihe  deepest  dive  of  the  record  occurred  as  the  tail  slapping  was  getting 
underway.  The  remainder  of  the  tagout  was  characterized  by  consistently  directed  swimming 
interspersed  with  occasional  broad  loops. 

oo()5_324a 

This  whale  swam  consistently  towards  the  west  with  right  angle  changes  of  direction.  No  tail 
slaps  or  suggestive  diving  patterns  were  observed. 

oo06_313s 

A  long  period  of  initial  directional  swimming  and  an  occasional  small  loop  involving  an  earh 
deep  dive  was  followed  by  a  brief  period  of  broad  looping  before  a  quick  episode  of  low  LI  and 
focal  tail  slapping.  'Ihe  record  ended  with  the  animal  returning  to  high  LI  movement. 

oo06_314a 

This  animal  moved  directionally  for  the  majority  of  the  record  until  beginning  a  low  LI  episode  at 
the  very  end  accompanied  by  focal  tail  slapping. 

oo06_314s 

An  initial  period  of  high  LI  ended  with  a  sharp  180°  turn  and  deep  dive,  after  which  tail  slapping 
accompanied  a  low  LI.  A  short  episode  of  subsequent  broad  looping  was  followed  by  another 
longer  period  of  directional  travel.  One  focal  rail  slap  was  produced  during  the  first  high  LI 
segment. 

oo06_317s 

An  initial  episode  of  rail  slapping  and  low  LI  transitioned  into  a  high  LI  phase  characterized  by  a 
single  deep  dive  and  broader  looping  behavior.  Directional  swimming  occurred  subsequently. 

oo06_327s 

Although  nearly  the  entire  record  of  this  animal  was  characterized  by  low  LI,  periods  of  high  uid 
moderate  circling  were  both  present.  The  looping  at  the  beginning  was  accompanied  by  the 
deepest  dives  compared  to  later  in  the  sequence. 

Table  2.2.  Behavioral  sequence  narrations  for  all  animals  based  on  track,  LI  and  diving  data.  The  LI  threshold 
of  0.76  divided  the  records  into  low  LI  (tight  circling  generally  accompanied  by  focal  and  non-focal  tail  slaps) 
and  high  LI  (directional  swimming  and  broad  looping  behavior)  periods. 

measures  tended  to  vary  considerably  during  low  LI  periods  and  moderately  or  minimally 
during  high  LI  periods.  Table  2.3  lists  the  time  intervals  corresponding  to  periods  of  low 
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Tag 

LI 

Time  interval  (minutes) 

Time  interval  (local  time) 

mean  LI 

oo()5 316a 

high 

4-158 

12:09  -  14:42 

0.95 

oo05 320a 

high 

4-278 

8:49-13:23 

0.91 

low 

4-15 

12:42  -  12:53 

0.62 

high 

15-86 

12:53-14:05 

0.97 

oo05_320b 

low 

86  -  102 

14:05  _  14:20 

0.68 

high 

102-107 

14:20  -  14:25 

0.80 

low 

107-128 

14:25  -  14:46 

0.63 

low 

4-15 

10:45-10:56 

0.71 

oouo _ jz i a 

high 

1 5  -  245 

10:56  -  14:45 

0.94 

oo05 321b 

high 

4-96 

11:52-13:25 

0.92 

high 

4-66 

10:14-11:16 

0.94 

low 

66-72 

11:16-11:22 

0.70 

oo05_322a 

high 

72  -  126 

11:22-12:16 

0.89 

low 

126-166 

12:16-12:56 

0.66 

high 

166-211 

12:56-13:41 

0.91 

low 

4-9 

11:31  -11:36 

0.65 

high 

9-27 

11:36-11:54 

0.87 

low 

27-30 

11:54-11:56 

0.74 

oo05_322b 

high 

30  -  35 

11:56-12:01 

0.81 

low 

35  -  39 

12:01  -  12:05 

0.73 

high 

39  -  42 

12:05  -  12:07 

0.79 

low 

42-90 

12:07-12:56 

0.59 

high 

90  -  179 

12:56  -  14:25 

0.94 

oo05 324a 

high 

4-  126 

13:39-15:41 

0.98 

high 

4-95 

13:42-15:13 

0.92 

oo06_313s 

low 

95  -  99 

15:13-15:17 

0.73 

high 

99  -  128 

15:17-15:45 

0.96 

high 

4-113 

10:37  -  12:27 

0.96 

OOUO_j  L^fA 

low 

113-121 

12:27  -  12:34 

0.65 

high 

4-43 

12:00-12:39 

0.91 

low 

43  -  45 

12:39  -  12:41 

0.74 

oo06_314s 

high 

45  -  48 

12:41  -  12:44 

0.83 

low 

48-64 

12:44-13:00 

0.63 

high 

64-120 

13:00  — 13:56 

0.91 

lllc 

low 

4-17 

14:37  -  14:50 

0.59 

OOUO_J 1 / s 

high 

17-34 

14:50-15:06 

0.87 

oo()6_327s 

low 

4  -  50 

13:47-14:32 

0.58 

Table  2.3.  Time  intervals  for  low  and  high  LI  periods  of  at  least  2  minutes  in  duration,  excluding  the  first  and 
List  4  minutes  because  of  the  smoothing  window  of  a  filter  (see  text). 


and  high  LI  and  their  mean  LI  values. 


Figures  2.10-2.13  visually  summarize  the  two  general  types  of  behavioral  sequences 
of  the  animals  observed  preceding  and  during  carousel  feeding  (see  Appendix  1  for  the  full 
tracks  of  all  13  animals).  The  first  involved  a  period  of  directional  swimming  followed  by 
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Figure  2.10.  Partial  pseudo-track  (non-geo-referenced  inertial  path)  of  male  oo05_320b  (1.3h  segment). 

1  lonzontal  movement  is  plotted  as  relative  distance  and  the  beginning  and  ending  of  this  segment  are  shown  (□ 
and  o,  respectively).  In  the  left  panel,  depth  is  colorized,  the  filled  gray  circles  represent  tail  slaps  produced  bv 
a  non-tagged  individual,  and  the  red  circles  indicate  tail  slaps  produced  by  the  tagged,  focal  animal.  In  the  right 
panel,  the  red  sections  of  the  track  correspond  to  periods  of  low  linearity  (rhresholded  using  the  focal  tail  slap 
data,  see  text  and  Figure  2.9).  This  path  reveals  movement  characterized  by  high  linearity  and  directional 
swimming  leading  into  movement  characterized  by  low  linearity. 


heavy  tail  slapping  activity.  The  track  segment  in  Figure  2.10  depicts  an  adult  male 
(oo05_320b)  that  initially  swam  southwards  until  making  a  sharp  hairpin  turn  to  the  north. 
'Ihis  period  of  movement  was  characterized  by  high  LI  (gray  in  the  right  panel)  and  an 
absence  of  local  tail  slaps.  Subsequently,  the  male  began  a  period  of  tight  circling  and  his 
path  became  highly  circuitous  (red  in  the  right  panel).  'Lhe  elevated  rates  of  focal  and  non 
local  tail  slap  production  were  indicative  of  feeding  behavior  (Similii  &  Ugarte,  1993; 
Domenici  et  al.,  2000;  Simon  et  al.,  2005). 

The  LI,  dive  profile  and  derived  movement  measurements  for  this  male  are  plotted 
in  Figure  2.1 1  (see  Appendix  1  to  view  these  plots  for  the  remaining  animals).  The  1.1 
oscillated  between  lower  values  during  TS  periods  and  a  higher  value  during  the  NTS  period. 
'Ihe  derived  movement  measurements  all  exhibited  increased  activity  and  variability  during 
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Figure  2.11.  Movement  data  summary  plots  for  male  oo05_320b.  The  linearity  index  (LI)  is  plotted  as  a  thin 
continuous  black  line  in  the  two  uppermost  panels  and  ranges  from  0  (no  displacement)  to  1  (straight  path). 
The  LI  plots  two  rough  states:  a  low  linearity  (high  circuitous)  state  when  the  animals  were  tail  slapping  and  a 
high  linearity  (low  circuitous)  state  when  few  tail  slaps  were  observed  (see  Figure  2.9).  Dive  duration  and 
maximum  dive  depth  are  indicated  with  the  magenta  squares  and  red  circles  in  the  left  and  right  panels, 
respectively.  The  small  gray  squares  and  open  black  circles  on  the  LI  curve  indicate  non-focal  and  focal  tail 
slaps,  respectively.  The  two  tail  slapping  (TS)  periods  are  indicated  by  the  black  horizontal  bars  at  the  top  of 
the  plot.  The  low  LI  episodes  are  shown  by  the  blue  horizontal  bars.  The  two  small  triangles  at  the  bottom  of 
the  plots  mark  the  time  interval  plotted  in  Figure  2.10.  Beginning  in  the  second  row  and  reading  left  to  right, 
the  remaining  panels  plot  the  change  in  pitch  (degrees),  depth  (m),  change  in  roll  (degrees),  vertical  velocity 
(m/ s),  residual  heading  (degrees)  and  variation  in  the  z-axis  of  the  accelerometer  (a  proxy  for  fluking  energy, 
relative  units).  See  text  for  computation  details.  The  change  in  pitch,  change  in  roll,  residual  heading  and  Az 
variation  all  increased  during  TS  periods.  Time  is  reported  locally  and  runs  identically  along  the  x-axis  of  each 
panel. 
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TS  periods  compared  to  NTS  periods  (see  Figure  2.3  as  well).  The  percussive  tail  slapping 
sounds,  echolocation  clicks  and  numerous  pulsed  calls  make  feeding  an  acoustically  active 
time  (Simula  &  Ugarte,  1993;  Van  Opzeeland  et  al.,  2005;  Simon  et  al.,  2007),  which  may 
attract  nearby  groups  of  killer  whales  to  a  ball  of  herring  that  has  already  been  brought  to  the 
surface.  This  may  explain  the  sudden  change  in  direction  by  the  male  here  since  focal  tail 
slapping  activity  ensued  suddenly  after  roughly  18  minutes  of  directional  swimming 
following  the  turn.  The  possible  strategy  of  feeding  on  a  previously  corralled  herring  school 
may  offer  an  opportunity  for  these  animals  to  forego  herding  the  fish  at  depth  and 
participate  only  in  the  feeding  endgame.  Indeed,  visual  observations  confirmed  that  this 
male  and  his  group  joined  another  group  that  was  already  carousel  feeding.  Two  other 
animals  displayed  a  similar  pattern  in  which  a  highly  circuitous  path  coupled  with  tail 
slapping  activity  was  preceded  by  a  low  LI  period.  In  one  instance  (oo06_314a),  the 
direction  of  movement  remained  consistent  but  in  the  other  (oo()6_314s),  a  rapid  change  in 
direction  preceding  the  tail  slapping  activity  similar  to  oo05_320b  was  observed.  No 
additional  visual  observations  of  the  presence  or  behavior  of  nearby  groups  were  available 
for  these  other  two  whales. 

When  carried  through  effectively,  the  second  type  of  foraging  sequence  involved 
broad  horizontal  looping  followed  by  tail  slapping  and  feeding.  The  path  of  the  female 
oo05_322a  (Figure  2.12)  exemplified  this  behavioral  routine  as  it  transitioned  from  high  to 
low  LI  (after  an  initial  low  LI  episode).  In  contrast  to  oo05_320b,  the  high  LI  period 
contained  numerous  broad  loops  accompanied  by  moderate  to  deep  dives  to  20  -  40  m. 
rIhe  absence  of  tail  slapping  activity  suggested  that  no  feeding  wras  taking  place.  The  diving 
depths  gradually  shoaled  after  the  deepest  dive  to  123  m  in  this  record.  This  use  of  3D 
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Figure  2.12.  Partial  pseudo-track  of  female  oo05_322a  (1.6h  segment).  Key  as  in  Figure  2.10.  The  right  panel 
shows  that  the  high  circuitousness  sections  (red)  accompanied  by  tail  slapping  were  interspersed  with  a  high 
linearity  section  (gray)  that  contained  vertical  excursions  and  broad  horizontal  looping  but  lacked  frequent  rail 
slapping. 


space  was  consistent  with  the  corralling  behaviors  previously  described  (Simila  &  Ugarte, 
1993;  Nottestad  &  Simila,  2001;  Nottestad  et  al.,  2002).  In  particular,  this  animal  may  have 
traveled  to  depth  to  locate  herring,  brought  them  to  the  surface  after  a  set  of  repeated  and 
increasingly  shallow  dives,  and  contained  them  by  broadly  looping  about  the  circumference. 

The  subsequent  tighter,  low  LI  loops  were  again  accompanied  by  numerous 
instances  of  non-focal  and  focal  tail  slapping  (Figure  2.12).  The  derived  movement 
measurements  registered  increased  activity  levels  during  the  low  LI  phase  (Figure  2.13).  The 
other  period  at  — 11 :30  exhibiting  elevated  activity  corresponded  to  another  section  of  tight 
circling  and  numerous  non-focal  tail  slaps  displayed  as  the  short  first  span  of  red  in  Figure 
2.12.  Three  other  whales  contained  similar  sequences  of  transitions  from  broad  to  tight 
horizontal  looping  (oo05_322b,  oo06_313s,  oo06_327s),  which  may  have  reflected  that  the 
herring  ball  required  restructuring  by  the  group  before  feeding  could  progress.  The  two 
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Figure  2.13.  Movement  data  summary  plots  for  female  oo05_322a.  See  Figure  2.1 1  for  kev.  The  TS  period 
was  flanked  by  two  NTS  periods.  The  two  small  triangles  at  the  bottom  of  the  top  two  sub-panels  mark  the 
time  interval  plotted  in  Figure  2.12. 


animals  that  were  tagged  simultaneously,  the  adult  female  oo05_322a  and  the  adult  male 
oo05_322b,  traced  our  similar  paths  of  low  LI  leading  to  high  LI  (see  Appendix  1), 
suggesting  that  closely  coordinated  and  sequenced  movements  may  be  essential  for  group 
members  to  contain  and  feed  on  herring. 

In  addition  to  oo05_322a,  7  other  animals  displayed  finis  broad  looping  behavior 
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during  which  only  occasional  focal  tail  slaps  were  produced  (i.e.,  oo05_320a,  oo05_32ta, 
oo05_321b,  oo06_317s),  the  putative  corralling  occurred  after  successful  tail  slapping  and 
feeding  had  been  replaced  by  directional  swimming.  The  episodes  of  putative  corralling 
displayed  by  3  other  animals  (oo05_321a,  oo06_313s,  oo06_314s)  contained  non-focal  tail 
slaps,  suggesting  either  that  they  were  produced  by  a  more  distant  non-focal  group  or  that 
the  animals  may  have  taken  turns  between  corralling,  in  which  group  members  including  the 
focal  animal  preserved  the  integrity  of  the  herring  ball,  and  tail  slapping.  This  kind  of 
coordinated  turn  taking  would  prevent  the  animals  from  feeding  all  at  once,  a  scenario  that 
could  lead  to  the  dispersal  of  the  fish  ball.  To  test  for  turn  taking,  sufficient  numbers  of 
animals  need  to  be  tagged  simultaneously  in  future  studies  to  capture  all  of  the  behaviors 
that  may  be  co-occurring  and  examine  the  numbers  and  patterns  of  animals  participating  in 
each  task. 

In  7  cases,  broad  horizontal  looping  did  not  always  lead  to  tail  slapping  and  feeding. 
Four  animals  interspersed  periods  of  directional  swimming  with  periods  of  putative 
corralling  (e.g.,  oo()5_320a,  oo05_321a,  oo05_321b,  oo()6_317s).  Anti-predator  responses  of 
herring  have  been  documented  (Nottestad  &  Axelsen,  1 999)  and  it  is  probable  that  herring 
take  advantage  of  momentary  breaks  in  orca  group  formation  to  disperse  laterally  and  dive, 
rendering  their  geometry  and  vertical  position  more  challenging  for  tail  slapping.  In 
addition,  certain  fish  densities  and  quantities  may  be  insufficient  to  merit  further  shoaling, 
leading  the  orcas  to  abort  their  efforts  prematurely  (Nottestad  et  al.,  2002).  A  subset  of  the  7 
whales  remained  closer  to  the  surface  after  a  successful  tail  slapping  episode,  suggesting  that 
they  may  have  been  trying  to  re-corral  the  ball  upon  which  the  group  had  been  feeding.  The 
data  presented  here  reveal  that  putative  corralling  efforts  may  go  unrewarded  rather 
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regularly.  This  would  make  the  strategy  of  feeding  upon  herring  that  require  minimal 
corralling  (since  they  have  already  been  gathered  close  to  the  surface  by  the  focal  or  non- 
focal  group)  all  the  more  advantageous. 

2.7  General  discussion 

The  data  presented  here  provide  a  detailed  portrait  of  the  movement  behavior  of 
individual  Norwegian  killer  whales  as  they  forage  in  groups  on  herring.  Previous  work 
described  carousel  feeding  episodes  consisting  of  a  deeper,  gradually  shoaling  corralling 
phase  followed  by  a  shallower  feeding  phase  (Simila  &  Ugarte,  1993;  Nottestad  &  Similii, 
2001;  Nottestad  et  al.,  2002).  Using  a  threshold  calculated  from  the  distribution  of  linearity 
index  (LI)  values  taken  at  the  moments  of  focal  tail  slaps  (Figure  2.9),  the  recordings  were 
examined  with  these  phases  in  mind.  High  LI  periods  mapped  onto  segments  of  track 
characterized  by  either  directional  swimming  or  broad  horizontal  looping.  Both  involved 
moderate  to  deep  vertical  excursions  and  occasional  non-focal  tail  slaps.  Directional 
swimming  and  the  broad  looping  seemed  to  fit  the  putative  behavioral  categories  of  “travel” 
and  “corralling,”  respectively.  Low  LI  segments  involved  tight  circling  and  contained 
numerous  instances  of  focal  tail  slapping  that  likely  indicated  feeding  activity. 

Low  LI  periods  were  by  definition  associated  with  more  variable  3D  movement 
features.  With  the  exception  of  a  single  measurement  for  one  individual,  the  vertical 
velocity,  fluking  energy  and  changes  in  pointing  angle  and  roll  were  highly  variable  (Figures 
2.3,  2.1 1  &  2.13).  This  is  consistent  with  the  frequent  changes  in  orientation  of  these 
animals  as  they  feed  that  have  been  observed  (Simila  &  Ugarte,  1993;  Domenici  et  al.,  2000; 
Simon  et  al.,  2005).  To  produce  effective  tail  slaps  and  to  compensate  for  the  dynamic 
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responses  of  both  conspecifics  and  the  herring  ball,  the  killer  whales  must  vary  and  make 
constant  adjustments  to  their  3D  orientation  and  position.  Additional  data  from  whales 
engaged  across  multiple  carousel  feeding  events  are  required  to  explore  how  the  animals 
balance  the  individual  need  to  feed  with  the  group  need  to  preserve  the  coherence  of  the 
herring  ball. 

A  subset  of  the  tagged  whales  displayed  a  carousel  sequence  consistent  with  the 
observations  of  earlier  research.  In  particular,  the  broad  horizontal  looping,  deeper  vertical 
excursions  and  high  LI  that  defined  putative  corralling  in  the  tag  data  gave  way  to  the  tight 
circling,  abundant  focal  and  non-focal  tail  slaps  and  low  LI  of  feeding  (Figures  2.12  &  2.13). 
Data  from  other  whales,  however,  suggested  that  it  was  not  uncommon  for  putative 
corralling  to  be  required  minimally  or  not  at  all  as  animals  began  feeding  on  herring  already 
at  the  surface  (Figures  2.10  &  2.11).  The  herring  follow  diurnal  vertical  migration  patterns, 
remaining  at  depth  during  the  daytime  to  avoid  visual  predators  and  traveling  to  the  surface 
at  night  to  feed.  This  made  it  doubtful  that  the  fish  first  located  at  the  surface  by  the  killer 
whales  had  shoaled  on  their  own.  Rather,  it  was  more  likely  that  they  had  escaped  from  an 
earlier  corralling  effort  organized  by  the  focal  or  non-focal  group  moments  earlier. 

The  order  and  timing  of  these  behaviors  and  movements  varied  and  likely  depended 
on  the  circumstances  of  the  actual  feeding  event.  For  example,  tail  slapping  did  not  always 
follow  putative  corralling.  No  cues  in  the  movement  data  consistently  anticipated  transitions 
from  one  behavior  to  the  next.  Rather,  these  stages  may  be  coordinated  vocally,  a  topic  that 
will  be  considered  in  Chapter  4.  Carousel  feeding  demands  a  significant  energetic  input  as 
evidenced  by  the  elevated  fluking  energy  and  more  variable  movement  measures  during 
feeding  periods  (Figures  2.3,  2.1 1  &  2.13).  The  data  presented  here  have  indicated  that  the 
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cost:benefit  ratio  may  be  even  higher  since  corralling  efforts  do  not  always  lead  to  actual 
feeding  episodes.  Furthermore,  it  is  likely  that  carouseling  should  yield  a  larger  energetic 
gain  for  killer  whale  groups  that  locate  herring  that  have  already  been  corralled  or  that  are 
closer  to  the  surface  due  to  bathymetry  or  fish  behavior  since  less  effort  is  required  to  initiate 
the  feeding.  The  extent  to  which  turn  taking  or  general  behavioral  state  synchrony  and 
different  tail  slap  timing  and  energetic  patterns  are  preferred  may  depend  less  on  the  killer 
whales  and  more  on  the  immediate  demands  imposed  by  the  movements  of  a  particular  fish 
school.  This  seems  especially  likely  given  that  otherwise  distinct  diving  profiles  produced  by 
the  two  synchronously  tagged  animals  became  more  similar  when  preceding  their  closelv 
spaced  tail  slaps  (Figure  2.5,  see  earlier  discussion). 

Carousel  feeding  by  Norwegian  killer  whales  shares  features  common  to  group 
foraging  behaviors  documented  in  other  animal  species.  First,  their  prey  source  is  abundant: 
a  vast  amount  of  herring  biomass  entered  the  fjords  each  winter  (Simila  et  al.,  1996). 
Secondly,  the  herring  possess  the  advantages  of  greater  maneuverability  and  acceleration 
compared  to  the  killer  whales  (Domenici  et  al.,  2000;  Domenici,  2001).  These  two 
characteristics  of  the  prey  field  make  carousel  feeding  an  especially  productive  strategy  for 
the  orcas  since  as  a  collective,  they  can  exploit  a  food  source  that  is  otherwise  difficult  to 
capture.  The  benefits  of  group  foraging  have  likely  contributed  at  least  partially  to  the 
development  of  the  highly  social  nature  of  these  animals,  offsetting  the  benefits  of  a  more 
solitary  lifestyle. 

Together,  these  results  point  out  that  carousel  feeding  is  dynamic  and  opportunistic, 
requiring  animals  to  secure  herring  at  depth  or  closer  to  the  surface,  herd  fish  schools  that 
may  or  may  not  have  been  already  corralled  by  another  group,  and  ensure  the  spatial 
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integrity  of  the  ball  for  as  long  as  feeding  is  taking  place.  The  animals  worked  frequendy  to 
corral  fish  (both  unsuccessfully  and  successfully)  into  a  preferred  configuradon,  presumably 
organizing  their  behaviors  to  avoid  interfering  with  one  another  when  taking  advantage  of 
tail  slapping  and  feeding  opportunities  .  Although  the  putative  corralling  behavior  was  not 
evident  from  visual  observation  alone  because  it  either  occurred  at  depth  or  resembled 
feeding  at  the  surface,  the  tag  movement  data  helped  to  distinguish  this  behavior.  The 
common  movement  and  focal  tail  slapping  patterns  observed  across  animals  during  putative 
feeding  periods  implied  that  the  tagged  whales  all  participated  in  the  carousel  and  received  an 
opportunity  to  eat. 

To  ascertain  whether  carousel  feeding  is  truly  cooperative,  however,  more  work  is 
required  to  demonstrate  that  all  of  the  animals  in  a  group  contribute  consistendy  to  the 
foraging  effort  and  that  each  individual  is  eating.  This  study  strongly  motivates  future 
tagging  studies  of  Norwegian  killer  whales  combined  with  simultaneous  sonar  or  video 
observations  of  herring  to  characterize  the  dynamics  between  predator  and  prey.  Longer, 
synchronous  tag  deployments  would  allow  comparisons  of  individual  performance  across 
multiple  carousels  to  gain  insights  into  the  likelihood  of  role  playing  and  behavioral 
stereotypy.  Synchronous  tagging  on  multiple  animals  combined  with  prey  mapping  would 
allow  an  investigation  of  how  killer  whales  both  distribute  themselves  in  3D  space  to  contain 
the  herring  ball  and  respond  as  a  group  once  the  herring  manage  an  escape  and  the  ball 
dissipates.  It  would  allow  further  exploration  of  whether  turn  taking  or  synchrony  of  group 
members  is  occurring.  A  comparative  tagging  study  of  bubble  net  feeding  humpback  whales 
and  carousel  feeding  killer  whales  could  offer  insights  into  the  elements  of  convergence 
between  these  two  animal  systems  that  forage  in  groups  on  herring.  It  would  highlight  both 


80 


the  unique  strategies  deployed  by  each  species  and  the  constraints  imposed  on  the  collective 
by  certain  biological  and  physical  aspects  of  the  feeding  paradigm. 
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Chapter  3.  Everything  but  the  Cetacean  Synch:  An  Example 
Driven  Discussion  of  Behavioral  Association  in  Marine 
Mammals 

3.1  Manuscript 

A  relationship  can  be  defined  as  a  sequence  of  interactions  between  a  pair  of 
individuals  that  recognize  one  another  (Hinde,  1976;  1979).  They  are  characterized  by  the 
character  and  patterning  (i.e.,  the  timing  and  rates)  of  these  interactions  and  rely  on  the 
participation  of  both  partners  (Hinde,  1976).  It  has  become  common  in  animal  behavior  to 
define  an  association  index  as  a  helpful  tool  that  uses  the  frequency  of  specific  interactions 
and  behaviors  to  indicate  the  strength  of  a  social  relationship.  Indeed,  animals  that  interact 
rarely  do  not  generally  have  much  of  a  relationship  but  those  that  interact  more  frequently 
are  predicted  to  share  a  strong  relationship. 

In  the  marine  mammal  literature,  association  is  defined  operationally  as  the  sighting 
of  two  or  more  individuals  at  the  same  time  and  in  the  same  place.  Field  work  and  logistical 
restrictions  aside,  association  parameters  are  ideally  grounded  in  the  biology  of  the  study 
animal  and  the  behaviors  that  are  used  to  maintain  social  relationships.  Nevertheless,  the 
spatial  and  temporal  scales  of  these  association  patterns  vary  dramatically  depending  on  the 
species,  the  behavioral  context  and  the  research  study  itself.  The  differences  in  the 
quantification  of  the  spatial  and  temporal  extent  of  marine  mammal  groups  illustrate  how 
observer-biased  definitions  can  lead  to  highly  variable  interpretations  of  the  same  social 
association  data. 

Ballance  (1990),  for  example,  assigned  bottlenose  dolphins  ( Tursiops  truncatus)  to  the 
same  social  group  if  they  were  sighted  simultaneously  from  the  research  vessel,  allowing 
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separation  distances  to  range  from  a  couple  of  body  lengths  to  a  few  kilometers.  In  studies 
of  this  same  species  in  Shark  Bay,  Australia,  however,  a  10m  chain  rule  was  adopted  to 
define  a  group  as  all  individuals  within  10m  of  one  another  (Smolker  et  ah,  1992;  1993; 

Mann,  2000).  Both  approaches  reported  that  despite  allowing  the  possibility  of  groups  to 
contain  animals  separated  by  large  distances,  associating  individuals  generally  surfaced  only 
within  a  couple  of  meters  of  each  other.  Relevant  spatial  scales  should  depend  on  the 
sensory  modality  and  sensitivity  of  the  organism  being  studied.  Botdenose  dolphin  whisdes 
propagate  for  much  shorter  distances  (i.e.,  kilometers:  Janik,  2000)  than  the  calls  of  certain 
species  of  baleen  whales  (i.e.,  hundreds  of  kilometers),  for  instance,  which  dramadcally 
influences  the  range  over  which  social  coordinadon  can  operate.  Visual  and  tacdle  cues 
funcdon  across  even  fighter  spadal  distances. 

Various  time  restrictions  have  been  applied  to  determine  grouping  patterns  in  marine 
mammals  as  well.  Some  associations  have  been  scored  by  the  concurrent  appearance  of 
multiple  surfacing  individuals  within  single  or  adjacent  photographic  frames  (e.g.,  botdenose 
dolphins:  Ballance,  1990;  killer  whales  ( Orcinus  orca ):  Bigg  et  al.,  1990).  This  imposes  a 
narrow  time  window  on  the  order  of  a  couple  of  seconds  in  which  animals  must  synchronize 
their  breathing  to  be  considered  associated.  (Photographs  grouped  more  broadly  according 
to  encounter  do  not  provide  sufficient  data  to  describe  or  infer  diving  synchrony 
information.)  As  Wiirsig  (1978)  discussed,  instances  of  non-synchronous  surfacing  may 
occur  for  both  associated  and  unassociated  individuals.  Other  experimental  paradigms  have 
relaxed  these  time  constraints,  assigning  all  dolphin  individuals  passing  by  an  observational 
lookout  in  a  single  day  to  the  same  group  (Wiirsig  &  Wiirsig,  1977;  Wiirsig,  1978). 

Timing  often  depends  on  the  behavioral  context  as  well,  however.  In  certain 
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contexts,  animals  may  function  well  as  a  group  with  more  synchronized  patterns  of 
movement  and  breathing.  Synchrony  can  confer  certain  advantages  including  predator 
avoidance  and  detection  (Pulliam,  1973),  information  storage,  and  smarter  decision-making 
(Parrish  &  Edelstein-Keshet,  1999).  In  particular,  synchronous  surfacing  to  breathe  was 
thought  to  decrease  the  probability  of  predation  upon  certain  air-breathing  fishes  (Kramer  & 
Graham,  1976;  Chapman  &  Chapman,  1994).  In  contrast,  groups  of  wild  ostriches  ( Struthio 
camelus)  lessened  their  predation  risk  by  desynchronizing 

their  vigilance  behavior  (i.e.,  time  spent  with  head  up,  Bertram,  1980).  Individual  vigilance 
decreased  as  a  function  of  group  size  and  allowed  the  animals  to  use  information  gathered 
from  the  environment  to  improve  their  safety. 

But  synchrony  can  also  reflect  a  competitive  agenda.  The  function  of  synchronizing 
vocal  output,  for  example,  ranges  from  social  affiliation  at  one  extreme  (e.g.,  vocal 
convergence  in  pair  bonding  budgerigars  (Melopsittacus  undulatus),  Hile  et  al.,  2000)  to 
competition  for  reproductive  access  at  the  other  (e.g.,  vocal  contests  in  black-capped 
chickadees  (Poecile  atricapillus),  Mennill  &  Ratcliffe,  2004).  Coordinated  behaviors  may  also 
require  individuals  sharing  social  relationships  to  act  in  ways  that  are  highlv  non- 
synchronized  in  space  or  time.  Indeed,  Whitehead  (1996)  documented  that  the  diving 
behavior  of  groups  of  sperm  whales  {Physeter  macrocephalus)  near  the  Galapagos  Islands  were 
more  asynchronous  when  calves  were  present  compared  to  groups  without  calves, 
suggesting  coordination  of  alloparental  care  of  calves  at  the  surface.  Desynchronizing  dives 
increased  the  amount  of  time  that  at  least  one  adult  was  near  the  calf. 

Whitehead  (1996)  and  Whitehead  &  Weilgart  (1991)  pointed  out  that  sperm  whales 
tended  to  have  two  basic  behavioral  contexts:  socializing  while  at  the  surface  and  foraging  at 
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depth.  Other  factors  influencing  behavioral  context  may  be  social,  including  affiliative  or 
aggressive  behaviors  (e.g.,  roving  males  during  the  breeding  season),  biological  (e.g.,  changes 
in  the  depth  of  the  prey  layer  or  to  allow  the  metabolism  of  lactic  acid  accumulated  from 
deep  diving),  or  physical,  including  oceanographic  properties  like  water  temperature. 

Indeed,  sperm  whales  off  the  Galapagos  tended  to  aggregate  at  the  surface  in  the  afternoons, 
perhaps  in  part  to  take  advantage  of  shallower  waters  warmed  by  the  sun  (Whitehead  & 
Weilgart,  1991). 

Synchrony  among  marine  mammals,  especially  related  to  breathing  at  the  surface,  has 
been  considered  an  indicator  of  social  relationship  (Fripp  et  al.,  2005)  and  alliance 
membership  (Connor  et  al.,  2006).  Social  association  has  also  been  defined  in  terms  of 
nearest  neighbor  identification.  Association  indexes  that  employ  surfacing  behaviors  (see 
Mann,  2000  for  a  review)  are  biased  in  2D  since  animals  spend  most  of  their  time  below  the 
surface,  are  free  to  orient  in  any  direction  underwater,  and  may  converge  or  spread  apart  at 
depth.  Here,  we  illustrate  the  differences  between  and  implications  of  social  association 
among  free-ranging  Norwegian  killer  whales. 

Digital  archival  tags  (DTAGs)  that  contained  hydrophones  for  sampling  sound  and 
an  accelerometer,  magnetometer  and  pressure  sensor  for  sampling  3D  movement  were 
attached  to  each  animal  (Johnson  &  Tyack,  2003).  A  VHF  beacon  allowed  tracking  of  the 
tagged  animal  for  the  duration  of  the  attachment.  After  retrieving  the  released  tag,  the  time- 
stamped  movement  data  were  calibrated  to  the  whale’s  frame  of  reference  (Johnson  & 
Tyack,  2003).  See  Table  3.1  for  the  tagging  details  associated  with  the  experiment.  In 
particular,  tags  were  deployed  simultaneously  on  two  pairs  of  killer  whales  each  belonging  to 
the  same  social  group  (i.e.,  4  animals  in  all).  The  movement  data  for  the  concurrently  tagged 
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Number  of  animals 

2  pairs 

17  Nov  2005: 

Date  and  animal  id 

oo05_321a  &  oo05_32 1  b; 

18  Nov  2005: 
oo()5 322a  &  oo()5 322b 

Study  sire 

Tys fjord,  Norway 

Attachment  mechanism 

carbon  fiber  hand  pole 

Tagout  duration  (h) 

4.2;  2.1;  3.6;  3.1 

Sampling  rate:  movement  sensors  (Hz) 

50 

Additional  visual  data 

annotated  rough 
behavioral  state  of  group 

Time  interval  threshold  ' 

3s 

Depth  threshold  (m)  * 

1 

Table  3.1.  Tagging  and  analysis  details.  Tagout  durations  proceed  in  the  same  order  as  the  listing  of  animal 
identification  labels.  The  last  two  rows  marked  by  asterisks  contain  inter-surfacing  time  interval  threshold 
between  individuals  and  the  depth  threshold  used  to  determine  whether  animals  surfaced  together. 


animals  were  aligned  as  closely  as  possible  using  file  rag  rime  sramps  rhat  were  each  ser  ro 
GPS  rime  before  even-  deployment  The  half-weighr  index  (HWT:  Cairns  &  Schwager,  1987; 
Briiger  er  al.,  1994)  was  used  ro  compute  coefficients  of  association  between  the  pairs  or 
trios  of  whales  during  the  rime  ranges  depicted  in  Figure  3.1.  In  rhe  equation: 

HWT  =  2i\r/(//[  +  nj, 

2N  is  rhe  total  number  of  joint  sightings  counted  once  for  every  appearance  of  the  two 
individuals  together,  n{  is  rhe  total  number  of  sightings  for  one  individual  and  //,  is  rhe  total 
tor  rhe  other  individual.  The  surfacings  of  two  individuals  were  considered  coincident  if 
they  occurred  within  a  set  time  interval  and  a  new  surfacing  was  only  counted  after  rhe 
animals  had  each  dived  beyond  a  set  depth  threshold.  These  thresholds  are  listed  in  Table 
3.1. 

Norwegian  killer  whales  travel  in  matrilineal  groups  and  forage  coopcrativelv  on 
herring  in  a  group  behavior  called  carousel  feeding  (Simila  &  Ugarte,  1993;  Simila,  1997; 
Domenici  et  al.,  2000;  Simon  et  al.,  2005).  In  a  series  of  coordinated  maneuvers,  they  corral 
the  fish  into  a  ball  at  the  surface,  tail  slap  to  incapacitate  the  fish  and  eat  them  one  by  one. 
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a:  killer  whale  double  tagout:  traveling 


b:  killer  whale  double  tagout:  carousel  feeding 


clock  time  (local) 


Figure  3.1.  Pairs  of  dive  profiles  from  simultaneously  tagged  Norwegian  orcas  belonging  to  the  same  social 
group.  Top:  Two  animals  traveling  at  opposite  ends  of  a  coherent  group,  17  November  2005.  Bottom:  Two 
killer  whales  carousel  feeding,  18  November  2005. 


The  sequence  of  traveling  and  foraging  behaviors  is  sufficiently  well  defined  in  killer  whales 
to  allow  us  to  ask  whether  the  degree  of  synchrony  or  coordination  displayed  by  a  group  of 
related  animals  can  be  viewed  as  a  stable  indicator  of  their  social  relationship  or  whether 
synchrony  may  vary  as  a  function  of  the  requirements  of  different  behavioral  states.  The 
dive  profiles  of  two  pairs  of  killer  whales  tagged  within  two  social  matrilineal  groups  were 
compared.  On  17  November  2005,  a  small  juvenile  whale  (oo05_321a)  and  an  adult  female 
(oo05_321b)  were  tagged  in  a  group  of  ~16  killer  whales.  At  the  end  of  the  follow,  .ill  of  the 
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animals  in  the  group  were  traveling  together  steadily  to  the  northwest.  The  tagged 
individuals  swam  at  opposite  ends  of  this  group  for  a  portion  of  the  time  that  they  were 
simultaneously  tagged  (102  minutes).  Figure  3.1a  plots  the  depth  profiles  of  these  two 
animals  during  30  minutes  of  this  traveling  period  (HW1  =  0.19).  These  animals  surfaced 
and  dove  in  a  fairly  synchronized  fashion.  The  deep  pordon  of  their  dives  tended  to  diverge 
more  (i.e.,  oo05_321b  tended  to  dive  shallower  than  oo05_321a)  than  their  more 
overlapping  descents  and  ascents.  Although  their  surfacing  and  diving  svnchrony  might 
suggest  close  social  association,  they  were  never  each  other’s  nearest  neighbor  when 
observed  at  the  surface.  Therefore,  the  use  of  the  nearest  neighbor  metric  of  association 
would  have  missed  the  3D  coordination  and  interactions  of  these  groups.  Nearest  neighbor 
data  may  be  useful,  however,  when  discerning  the  relative  importance  of  individual 
relationships  among  more  fluid  group  structures  such  as  those  found  in  bottlenose  dolphin 
communities. 

On  18  November  2005,  an  adult  female  (oo05_322a)  and  adult  male  (oo05_322b) 
were  tagged  in  a  group  of  ~15  carousel  feeding  animals.  The  tags  recorded  data 
simultaneously  for  140  minutes;  30  minutes  of  their  overlapping  dive  profiles  are  presented 
in  Figure  3.1b  (HWI  =  0.1 1).  In  contrast  to  oo05_321a  and  oo05_321b,  the  foraging  efforts 
of  oo05_322a  and  oo05_322b  did  not  result  in  consistently  synchronized  diving,  even  as 
they  used  tail  slaps  to  feed  on  the  herring  ball.  We  therefore  observed  highly  synchronous 
diving  behavior  by  the  two  animals  that  were  traveling  steadily  (Figure  3.1a),  which  differed 
from  the  asynchronous  diving  by  the  two  individuals  that  were  coordinating  with  their  group 
to  carousel  feed  (Figure  3.1b).  The  HWI  was  higher  for  the  two  traveling  animals  than  the 
two  carousel  feeding  whales.  The  shallower  depth  and  shorter  time  interval  thresholds  used 
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for  the  four  killer  whales  contributed  to  their  low  HWI  scores. 


While  the  differences  in  synchrony  and  HWI  of  the  two  pairs  of  tagged  killer  whales 
may  have  been  a  function  of  the  individual  differences  in  the  strength  of  social  relationship 
among  the  tagged  whales,  differences  in  the  behavioral  state  (i.e.,  traveling  versus  carousel 
feeding)  could  just  as  likely  be  the  primary  influence.  To  maintain  contact  and  cohesion 
during  travel,  it  is  likely  that  synchronized  vertical  movement  through  the  water  column  was 
useful  to  the  group  since  vocal  acdvity  was  largely  absent.  The  similar  breathing  frequencies 
of  oo05_321a  and  oo05_321b  at  the  surface  (assuming  that  every  depth  zero-crossing 
corresponds  to  a  single  breath)  also  implies  comparable  oxygen  demands  due  to  diving  to 
similar  depths. 

The  movement  and  behavioral  demands  of  carousel  feeding  on  a  group  are  different 
than  traveling:  the  animals  must  continuously  and  dynamically  preserve  the  integrity  of  a  ball 
of  herring  while  individual  killer  whales  tail  slap  and  consume  fish  from  its  periphery.  The 
dive  records  in  Figure  3.1b  reveal  certain  portions  that  were  out  of  phase  with  each  other 
and  fewer  portions  that  were  in  phase.  If  members  of  a  group  were  to  dive  synchronously 
all  of  the  time,  the  fish  might  be  able  to  take  advantage  of  the  absence  of  whales  at  a 
particular  depth  to  escape.  Instead,  the  orcas  appear  to  work  different  depths  to  their 
advantage,  trapping  the  herring  against  the  surface  and  preventing  their  lateral  dispersal  by 
taking  turns  at  depth.  Breathing  requires  an  orca  to  position  itself  at  the  surface  because  if 
all  group  members  took  a  breath  synchronously,  the  fish  would  have  a  greater  opportunity  to 
escape. 

It  is  important  to  exercise  caution  when  making  inferences  about  the  social 
implications  of  behavioral  synchrony.  For  example,  one  might  assume  that  individuals 
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oo05_321a  and  oo05_321b  maintain  a  closer  social  bond  than  oo05_322a  and  oo05_322b 
based  on  a  higher  HWI  and  the  extent  to  which  they  synchronize  their  diving.  It  is  worth 
considering  the  possibility  that  the  behavioral  regime  and  context  may  be  the  principal  factor 
affecting  the  synchrony  of  diving  patterns  of  these  three  whale  species.  Evidence  of 
synchrony  may  indicate  individual  or  group  relationship  but  an  absence  of  synchrony  does 
not  demand  an  absence  of  social  relationship:  it  could  suggest  that  a  different  kind  of 
behavioral  coordinadon  is  present.  Indeed,  HWI  alone  reflects  a  2D  temporal  relationship 
that  does  not  necessarily  depict  the  extent  to  which  two  animals  may  be  coordinating  their 
behaviors  spatially  and  associating  in  3D. 

Figure  3.2  plots  HWI  as  a  function  of  the  time  interval  considered  to  constitute  a 
synchronous  surfacing  between  animals.  Naturally,  as  the  time  interval  grew  and  additional 
surfacings  separated  by  longer  periods  of  time  were  considered  synchronous,  the  HWI 
increased  incrementally.  The  magnitude  of  the  change  in  this  step  size  depended  on  the  total 
number  of  dives  considered  in  the  calculation  and  the  spread  of  the  inter-surfacing  intervals. 
Killer  whales  oo05_322a  and  oo05_322b,  for  example,  surfaced  numerous  times  with  a  wide 
range  of  inter-surfacing  intervals  and  therefore  displayed  short  step  lengths  in  Figure  3.2. 

This  figure  shows  that  HWI  in  this  dataset  is  sensitive  to  the  time  interval  selected  and  that 
certain  diving  regimes  may  be  more  tolerant  to  flexibility  in  selecting  this  interval.  It  is  useful 
to  examine  such  a  plot  to  ascertain  the  stability  of  the  HWI  value. 

The  behavior-dependent  diving  synchrony  or  asynchrony  for  the  species  described 
here  demonstrated  that  social  bonds  can  be  expressed  in  a  variety7  of  ways.  Animals  that 
cluster  at  the  surface  may  associate  affiliatively  in  3D  or  they  may  disband  underwater  and 
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- Orcinus  pair:  321  a,b 

- Orcinus  pair:  322a, b 


Figure  3.2.  Half-weight  index  (HWI)  as  a  function  of  the  time  interval  considered  to  be  a  synchronous 
surfacing  for  each  pairing  of  animals  in  Figure  3.1. 


function  either  independently  or  cooperatively.  Animals  that  do  not  cluster  at  the  surface 
may  still  coordinate  their  independent  behavior  through  vocal  communication  or  turn  taking 
(e.g.,  Whitehead,  1996).  Visual  association  indexes  may  have  made  the  best  of  incomplete 
observations  of  marine  mammals  but  we  are  no  longer  limited  to  these  opportunistic 
sightings  of  surfacing  individuals.  New  electronic  data  collection  tools  like  the  DTAG  are 
yielding  movement  and  behavioral  data  that  permit  direct  measurement  of  vocal 
communication  and  continuous  social  association  indexes.  These  metrics  are  more  relevant 
to  the  animals,  rendering  realistic  aspects  of  their  socially  complex  behaviors  accessible  to 
study. 
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Tagging  multiple  animals  will  permit  an  understanding  of  whether  certain  behavioral 
contexts  better  predict  synchrony  of  movement.  Tagging  studies  can  be  combined  more 
with  traditional  photographic  methods  of  computing  association  indexes,  which  would  help 
determine  whether  these  two  techniques  can  function  independently.  It  is  likely,  however, 
that  the  two  will  inform  one  another  as  the  photography  can  complement  the  tag  data, 
offering  nearest  neighbor  information  and  helping  subsequently  to  synchronize  the  timing  of 
the  multiple  tag  records.  In  addition,  it  might  be  possible  to  connect  certain  surfacing 
behaviors  with  corresponding  3D  association  patterns  and  vice  versa.  When  designing  these 
studies,  it  is  important  to  consider  1)  the  social  and  behavioral  implications  of  the  presence 
or  absence  of  synchrony,  association  and  coordination;  2)  the  possibilitv  that  behavioral 
coordination  may  or  may  not  lead  to  synchronization;  and  3)  the  ways  in  which  individual  or 
groups  of  marine  mammals  engage  in  simultaneous  behavior  either  as  a  result  of 
encountering  and  reacting  to  a  common  stimulus  or  to  achieve  a  shared  goal.  These 
elements  will  all  help  to  provide  information  on  the  strength  of  social  relationships  between 
individuals  within  a  population. 
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Chapter  4.  Relationship  between  Movement  and  Vocal  Behavior 
of  Free-Ranging  Norwegian  Killer  Whales 

4.1  Abstract 

Though  the  rates  of  killer  whale  pulsed  calls  fluctuate  according  to  behavior  state, 
the  individual  call  types  are  generally  considered  to  be  functionally  interchangeable.  1  test 
this  assertion  using  individual  movement  data  and  group  calling  sequences  recorded  with 
digital  archival  tags  attached  to  free-ranging  Norwegian  killer  whales.  Twelve  animals  were 
tagged  and  seven  of  these  engaged  in  carousel  feeding,  a  vocally  active  time  when  herring 
were  trapped  at  the  surface,  tail  slapped  and  eaten.  On  4  occasions,  carousel  feeding  was 
preceded  by  a  vocally  active  period  of  putative  corralling  involving  broad  looping 
presumably  to  locate  herring  at  depth.  In  contrast,  in  2  of  the  3  instances  in  which 
carouseling  was  anucipated  by  direcdonal  travel,  the  animals  were  silent,  suggesdng  that  thev 
may  have  eavesdropped  to  locate  conspecific  groups  that  were  already  feeding  on  herring  at 
the  surface.  The  recordings  were  then  divided  into  two  general  behavioral  states:  tail 
slapping  (TS)  periods  that  coincided  with  carousel  feeding  activity  and  periods  with  no  tail 
slapping  (NTS).  1  predicted  that  killer  whales  depended  on  orientation  cues  more  during 
carousel  feeding  than  other  behaviors.  The  relative  differences  in  level  between  the  low  and 
high  frequency  components  of  pulsed  calls  may  provide  such  an  orientation  cue  of  the 
signaler  to  the  receiver.  My  prediction  was  confirmed  using  a  rotation  test  that  preserved  the 
serial  dependence  of  the  original  data:  more  calls  characterized  by  both  components  than  the 
low  component  alone  were  produced  during  TS  than  NTS  episodes  in  5  of  the  7  whales. 
These  results  were  consistent  with  the  top  three  call  types  contributing  to  the  significant 
differences  in  the  rates  of  call  type  production  between  TS  and  NTS  periods  in  all  but  one  of 
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the  whales.  Collectively,  these  results  are  consistent  with  the  premise  that  Norwegian  killer 
whales  use  their  vocalizations  to  provide  information  to  conspecifics  about  their  3D  position 
and  orientadon  as  they  corral  and  feed  in  groups.  No  significant  reladonship  was  found 
between  call  type  and  the  actual  measurements  from  the  movement  sensors.  Addidonal  data 
collected  across  multiple  days  and  carousels  are  required  to  explore  the  possibility  of  more 
detailed  relationships  between  movement  and  vocal  behavior. 

4.2  Introduction 

A  classical  ethological  approach  suggested  that  much  of  animal  communication 
involves  a  linked  system  of  inherited  motor  patterns  that  generate  signals  with  specific 
functions  coupled  with  sensory  mechanisms  in  the  receiver  to  detect,  classify  and  respond 
appropriately  to  the  signal.  Following  this  approach,  many  ethologists  attempt  to  link  single 
vocal  categories  to  specific  behavioral  contexts  of  the  signaler  and  specific  behavioral 
responses  of  conspecifics  within  earshot.  Some  marine  mammal  studies  report  strong  links 
between  signaling,  context  and  response.  In  southeast  Alaska,  groups  of  humpback  whales 
( Megaptera  novaeangliae)  produce  a  series  of  cries  that  crescendo  to  one  especially  loud  cry  that 
immediately  anticipates  their  coordinated  surfacing  and  lunge  feeding  on  herring  ( Clupea 
harengus)  (D’Vincent  et  al.,  1985;  Cerchio  &  Dahlheim,  2001).  It  is  possible  that  the 
humpback  whales  use  these  cries  to  synchronize  the  endgame  of  their  foraging  sequence. 
Icelandic  killer  whales  ( Orcinus  orca)  likely  herd  herring  using  a  loud  136  pulsed  call  (Simon  et 
al.,  2006).  The  fundamental  frequency  of  this  call  resembles  the  resonant  frequency  of  the 
herring  swim  bladder  and  the  frequency  of  maximum  hearing  sensitivity  in  herring.  Thus, 
whereas  the  humpbacks  appeared  to  use  their  cries  to  coordinate  the  behavior  of 
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conspecifics,  the  killer  whales  seemed  to  exploit  the  physical  properties  of  their  call  to  handle 
their  prey  more  effecdvely. 

Because  of  the  traditional  difficulties  associated  with  acquiring  continuous  behavioral 
observations  of  individual  marine  mammals,  the  functional  links  between  their  behaviors 
and  many  of  their  vocalizations  were,  until  recently,  generally  not  as  apparent.  One  common 
analytical  approach  involved  dividing  the  behavioral  sequence  into  a  series  of  states  and 
determining  whether  these  states  were  characterized  by  a  reliable  subset  of  vocal  classes  (e.g., 
Atlantic  spotted  dolphins  (Stenella frontalis)  and  bottlenose  dolphins  (Turszops  truncatus), 
Herzing,  1996).  Links  between  the  function  and  context  of  vocal  behavior  have  improved 
substantially  using  digital  archival  tags  that  simultaneously  record  movement  and  acoustic 
data.  Northeast  Pacific  blue  whales  ( Balaenoptera  musculus),  for  example,  produced  singular  D 
calls  frequently  and  singular  B  calls  occasionally  during  davtime  foraging  dives,  which 
suggested  a  function  relating  to  feeding  (Oleson  et  al.,  2007a;  2007b).  Blue  whale  song, 
however,  was  produced  at  dawn  and  dusk  exclusively  by  males,  implving  a  reproductive 
purpose. 

Extensive  evidence  of  vocal  production  learning  among  marine  mammals  raises 
questions  about  the  interpretation  of  communication  in  these  species.  During  the  1960s, 
ethologists  recognized  that  oscine  songbirds  do  not  inherit  the  motor  patterns  that  generate 
their  songs,  but  rather  need  to  hear  songs  to  produce  the  typical  songs  of  their  population. 
These  birds  have  evolved  neural  mechanisms  that  enable  more  flexibility  in  vocal 
development  in  which  communication  signals  produced  by  males  can  be  influenced  by  both 
what  the  male  hears  and  feedback  from  females. 

As  has  been  demonstrated  for  many  reproductive  advertisement  displays,  sexual 
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selection  can  lead  to  the  elaboration  of  complex  displays.  One  form  of  complexity  of  bird 
song  involves  song  repertoires  where  one  male  may  sing  many  different  songs.  Females  of 
some  species  prefer  males  with  larger  song  repertoires  and  in  these  species  selection  favors 
males  with  the  capacity  to  remember  and  produce  many  songs.  Early  reports  of  song 
repertoires  assumed  that  songs  were  interchangeable  and  funcdonaUy  equivalent  (see  Krebs 
&  Kroodsma,  1980;  McGregor,  1991).  Subsequent  investigation  of  communicadon  between 
males,  however,  uncovered  that  males  may  select  specific  songs  from  their  repertoires  to 
modulate  aggressive  or  territorial  interacuons.  If  a  territorial  male  matches  the  song  of  a 
neighbor,  this  is  likely  to  escalate  the  fight  On  the  other  hand,  if  a  male  responds  to  a 
neighbor  with  a  different  song,  but  one  that  these  two  birds  shared  in  common  (male  song 
sparrows  ( Melospi^a  melodia ),  Beecher  et  al.,  1996),  this  “repertoire  matching”  can  help  to 
prevent  the  interaction  from  escalating  into  a  fight. 

Like  songbirds,  killer  whales  have  been  reported  to  imitate  sounds  (Bowles  et  al., 
1988;  Foote  et  al.,  2006)  and  members  of  each  killer  whale  group  produce  a  repertoire  of 
shared  calls.  Most  research  on  these  stereotyped  calls  has  largely  concluded  that  the 
different  calls  in  the  repertoire  are  contextually  and  functionally  equivalent.  Ford  (1989) 
found  no  evidence  that  different  call  types  were  produced  reliably  according  to  behavioral 
state  by  resident  fish-eating  killer  whales  in  the  Pacific  Northwest.  The  overall  rate  of 
vocalization  of  killer  whales  may  vary  by  behavioral  context.  Vocalization  rates  tended  to 
climb  during  periods  of  high  arousal  (Bain,  1986),  whereas  calling  activity  generally  fell  to 
low  levels  or  zero  during  periods  of  resting  (Hoelzel  &  Osborne,  1986;  Ford,  1989). 
Transient  killer  whales  must  additionally  remain  silent  as  they  forage  to  prevent  alerting  their 
marine  mammal  prey  whose  hearing  is  sensitive  in  the  frequency  range  of  pulsed  calls 
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(Deecke  et  al.,  2002).  During  the  other  behaviors  of  surface  activity,  slow  travel  and  milling 
after  a  kill,  no  difference  in  call  type  usage  by  the  transients  was  observed  (Deecke  et  al., 
2005). 

There  is  some  evidence  for  differential  usage  of  call  types  in  different  contexts. 

Some  call  types  may  be  used  more  frequency  during  intra-pod  meedngs  (Miller  &  Bain, 
2000).  Van  Opzeeland  et  al.  (2005)  explored  the  vocal  behavior  of  Norwegian  killer  whales 
during  two  types  of  foraging.  Seiner  feeding  involved  orcas  feeding  on  herring  discarded 
from  fishing  boats.  Carousel  feeding,  by  contrast,  referred  to  the  efforts  of  a  group  of 
whales  that  corralled  herring  from  depth,  trapped  them  in  a  dght  ball  against  the  surface  and 
tail  slapped  the  edge  of  the  ball  to  stun  the  fish  before  eadng  them  one  by  one  (Christensen, 
1978;  Simila  &  Ugarte,  1993).  N21  was  the  only  call  type  that  showed  stadsdcally-significant 
differendal  usage  as  it  was  produced  more  frequendy  during  seiner  than  carousel  feeding 
(Van  Opzeeland  et  al.,  2005).  The  sample  sizes  were  somewhat  unbalanced,  however,  since 
only  2  seiner  feeding  events  were  observed  reladve  to  16  carousel  feeding  episodes.  Another 
potendal  confounding  factor  of  this  analysis  was  the  difference  in  call  repertoires  between 
pods.  Each  Norwegian  killer  whale  pod  vocalizes  3  to  16  call  types  (Strager,  1995).  These 
repertoires  can  overlap  but  it  is  possible  that  the  groups  might  produce  shared  call  types  in 
different  contexts  or  propordons. 

Killer  whale  pulsed  calls  are  often  characterized  by  two  simultaneously-produced,  vet 
independendv-modulated  components:  a  low  frequency  component  (LFC)  occurring 
between  80  Hz  and  2.4  kHz  (Ford,  1987)  and  a  high  frequency  component  (HFC)  ranging 
from  2  to  12  kHz  (Hoelzel  &  Osborne,  1986).  These  components  differ  in  terms  of  their 
directionality  as  well.  The  HFC  is  beamed  forward  from  the  melon  of  the  animal  whereas 
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the  LFC  is  less  directional  (Miller,  2002).  When  Miller  (2002)  recorded  directly  in  front  of  a 
vocalizing  animal  in  the  wild,  the  calls  contained  roughly  equal  levels  of  LFC  and  HFC 
energy.  When  recording  from  behind  the  animal,  however,  the  HFC  was  considerably  softer 
or  absent  altogether.  Combined  with  other  spectral  characteristics  (Miller  et  al.,  2007),  the 
relative  proportion  of  energy  between  the  LFC  and  HFC  may  allow  the  signaler  to  provide 
conspecifics  with  orientation  cues.  In  addition,  the  peak  energy  ratio  between  the  first  and 
second  harmonics  of  the  LFC  was  significantly  greater  for  calls  vocalized  by  adult  female 
versus  adult  male  killer  whales  (Miller  et  al.,  2007).  These  somewhat  subtle  details  of  pulsed 
calls  may  constitute  salient  elements  to  which  conspecifics  attend  and  might  be  related  to  the 
use  of  certain  call  types  over  others  in  different  behavioral  contexts. 

Much  of  the  earlier  research  has  supported  a  null  hypothesis  that  killer  whale  call 
types  are  interchangeably  produced  independent  of  behavioral  context.  Analyses 
incorporating  the  categories  of  call  types  being  produced  and  more  detailed  representations 
of  the  movements  of  the  animals,  however,  may  help  reveal  the  relationships  between  vocal 
and  non-vocal  behavior.  Using  data  gathered  from  digital  archival  tags,  this  chapter  explores 
the  possible  relationships  between  the  individual  movements  and  group  vocal  behavior  of 
free-ranging  Norwegian  killer  whales.  I  specifically  query  whether  call  types  are 
interchangeable  at  both  coarse  and  fine  time  and  categorization  scales. 

Killer  whale  call  types  do  not  appear  to  be  generated  as  a  string  of  independent 
events.  Often,  for  example,  call  types  tend  to  be  repeated  in  series  (Ford,  1989;  Miller  et  al., 
2004c).  One  of  the  primary  methodological  difficulties  is  dealing  with  the  serial  dependence 
of  these  sequences,  or  the  tendency  for  the  occurrence  of  a  call  type  to  depend  on  the 
preceding  call  type.  Because  serial  dependence  renders  invalid  standard  statistical  tests  that 
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treat  each  call  as  an  independent  event,  I  instead  took  advantage  of  non-parametric  rotadon 
tests  and  parametric  bootstraps  using  fitted  Markov  chain  models 

At  first,  this  analysis  parallels  earlier  studies  by  dividing  the  dataset  into  behavioral 
states.  The  tail  slapping  and  not  tail  slapping  states  were  defined  based  on  the  acousdc  and 
movement  signatures  of  tail  slaps  identified  in  Chapter  2.  It  considers  broad  divisions  of 
calls  according  to  spectral  characteristics  as  well  as  a  more  specific  treatment  of  the 
individual  call  types.  Secondly,  I  take  advantage  of  the  detailed  movement  sensor  data  to  test 
whether  call  type  can  serve  as  a  reliable  predictor  of  movement  features  and/or  whether 
particular  movement  data  regimes  can  anticipate  certain  call  types  dependablv. 

4.3  General  materials  and  methods 

Field  work  was  conducted  aboard  a  sailing  vessel  and  rigid  hull  inflatable  boat  in 
November  2005  and  2006  in  Tvsfjord  and  Vestf|ord  in  northern  Norway.  Digital  archival 
tags  were  attached  to  14  free-ranging  killer  whales  using  a  handheld  7m  carbon-fiber  pole 
(Johnson  &  Tyack,  2003).  These  tags  recorded  movement  data  at  50  Hz  and  sound  data  at 
96  kHz,  released  from  the  animal  after  a  designated  period  of  time  and  were  collected  for 
data  offload  and  subsequent  redeployment.  All  sensor  and  audio  files  were  burned  to  CD. 
The  sensor  data  were  calibrated  and  converted  to  the  whale’s  frame  of  reference  (pitch:  —90° 
=  pointing  downwards  and  +90°  =  pointing  upwards;  roll:  0°  =  dorsal  side  up  and  180°  = 
ventral  side  up;  heading:  compass  bearing).  The  pressure  sensor  provided  depth 
measurements.  Several  measurements  were  derived  from  the  calibrated  movement  data 
including  fluking  intensity  (calculated  as  the  mean  square  of  the  filtered  z-axis  of  the 
accelerometer:  Miller  et  al.,  2004b;  Hooker  et  al.,  2005;  Chapter  2)  and  derivatives  of  roll. 


101 


depth  (i.e.,  vertical  velocity)  and  poindng  angle  (a  variable  that  combined  the  pitch  and 
heading  data  to  compute  the  3D  orientation  of  the  whale’s  longitudinal  axis:  Miller  et  al., 
2004a).  A  linearity  index  (LI),  a  path-based  measure  of  the  inverse  tortuosity  of  the 
horizontal  pseudo-track,  was  calculated  according  to  the  method  described  in  Chapter  2. 
Briefly,  a  window  was  slid  along  the  record  and  for  each  secdon  of  track,  the  shortest 
distance  between  the  beginning  and  ending  points  was  divided  by  the  length  of  the  full  path 
traversed  by  the  whale  (see  Wilson  et  al.,  2007).  This  rado  was  concerned  with  the  reladve 
difference  in  scale  between  these  two  quanddes  and  not  the  actual  distance  measures,  which 
were  not  known.  The  values  ranged  from  0,  or  absolute  circuitousness,  to  1,  or  straight  line 
travel. 

The  audio  records  were  scanned  manually  and  all  acousdc  events  (i.e.,  tail  slaps,  stereotyped 
and  variable  pulsed  calls,  echolocadon  clicks,  etc.)  were  documented.  Three  naive  observers 
and  I  sorted  the  pulsed  calls  into  different  categories  by  visual  inspecdon  of  the 
spectrograms.  Call  types  were  labeled  according  to  earlier  catalogues  (Strager,  1993;  Van 
Opzeeland  et  al.,  2005)  or  given  new  numbers  if  they  had  not  been  described  previously. 

(See  Chapter  5  for  a  more  detailed  descripdon  of  assigning  calls  to  type.)  Table  4.1  lists 
which  call  types  occurred  in  each  recording  and  Table  4.2  offers  the  duradon  and  mean 
frequency  data  for  all  call  types  measured  by  the  contour  tracing  described  in  Chapter  5. 
Calling  bouts  were  defined  as  periods  of  dme  containing  at  least  10  calls  occurring  within  5 
minutes  of  one  another.  The  movement  record  of  one  of  the  whales (oo06_324s)  could  not 
be  calibrated  and  the  audio  record  of  another  one  of  the  whales  (oo05_321b)  did  not  contain 
any  vocal  acdvity.  Because  the  analysis  here  required  both  the  movement  and  vocal  data 
streams,  these  two  tagouts  were  jetdsoned  from  the  analysis,  leaving  12  whales  and 
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recordings  that  totaled  30.6  hours  of  data. 


4.4  Analysis  1:  State-dependent  models 

Previous  research  has  relied  on  observations  of  surface  activity  to  segment  killer 
whale  activity’  into  general  behavior  states  (Bain,  1986;  Hoelzel  &  Osborne,  1986;  Ford, 

1989;  Deecke  et  al.,  2005).  State-dependent  models  assume  an  ability  to  define  most  or  all  of 
the  important  phases  of  a  particular  time  course,  which  would  correspond  here  to  the 
behavioral  states  of  the  tagged  animal.  Paralleling  earlier  studies,  then,  this  section  examines 
whether  there  is  any  evidence  for  state-dependent  usage  of  calls.  I  first  examined  whether 
the  presence  or  absence  of  calling  activity  is  associated  with  particular  movement  features 
and  then  with  two  specific  feeding- related  behavioral  sequences.  Next,  I  tested  for 
ditferences  in  call  type  usage  based  on  the  functional  benefit  of  cueing  orientation  during 
carousel  feeding. 

4.4.1  Methods 

Two  behavioral  states  were  identified  based  on  the  occurrence  of  tail  slaps  in  the 
audio  record  (see  Chapter  2  for  complete  details).  Tail  slapping  (TS)  periods  contained  at 
least  3  tail  slaps  that  were  produced  within  2  minutes  of  one  another.  To  exclude  episodes 
of  tail  slapping  that  were  conducted  entirely  by  a  non-focal  group,  TS  periods  included  at 
least  one  tail  slap  that  was  produced  by  the  tagged  focal  animal.  All  other  segments  of  the 
record  were  considered  not  tail  slapping  (NTS)  penods.  Because  tail  slapping  occurred 
during  carousel  feeding,  TS  episodes  were  considered  representative  of  foraging  behavior. 
NTS  periods,  however,  likely  encompassed  a  wide  array  of  behaviors  that  excluded 


103 


Table  4.1.  Occurrence  of  call  types  in  different  recordings  (continued  onto  next  page).  Yellow  cells  indicate 
call  types  that  were  documented  in  3  or  fewer  recordings. 


call  type  316a 

8  2 

320a  320b 

9  37 

321a  321b 

12 

322a  322b 

28  23 

324a  313s 

16 

314a  314s 

4 

317s  324s 

327s 

59 

9.2 

1 

_ 27 

10 

27  30 

2 

99  :  41 

5 

3  3 

2 

1 

10J 

1 

10 

12.1  337 

449  125 

1 

32 

3 

15 

7 

4 

9  17 

1  2 

1 

7 

16.1  81 

193  98 

9 

1  1 

2  36 

2  11 

21  62 

116  43 

1 

23.2 

87  129 

2 

23.3 

4 

2 

4 

10 

26.2 

12 

1  68 

i 

2 

32  229 

247  42 

32.2  3 

2  4 

45  46 

64  35 

2 

11  7 

13  4 

4 

1 

1 

64  137 

150  27 

3 

2  19 

2 

65 

11  14 

66  245 

99  29 

16  1 

66.2  2 

1  2 

67  189 

70  18 

3 

1  13 

2 

68 

5  16 

69  43 

63  8 

2 

3 

69.2  2 

71 

3  5 

72 

1  20 

2 

325  246 

2  64 

2 

17 

72.2 

3  37 

9 

301  242 

153  242 

3  1 

132 

72.3 

1  6 

71  37 

14  65 

40 

73 

3  37 

377  221 

55  3 

116  282 

12  4 

103 

73.3 

4 

74 

9 

1 

75 

11 

76  96 

44  2 

1 

3 

1 

76.2 

43 

77 

9 

88  67 

24 

168  36 

10  8 

_ 72 

77.2 

6 

8  10 

1 

1  1 

7 

77.3 

22 

78 

103  120 

79 

28 

80 

95  150 

9 

4 

5 

81 

2 

76 

15  128 

2 

1  2 

3 

813  || 

15  1 

82 

9 

10 

4 

83  1 

1  1 

3 

19 

104 


call  typ 

c  316a  320a  320b 

321a  321b  322a 

322b 

324  a  313s  314a  314s  317s  324s 

327s 

84 

19 

11 

13 

24 

85 

2 

11 

42 

29 

12  8  3 

14 

86 

1 

7 

7 

87 

1  13  5 

10 

4 

3  3 

88 

13  1 

89 

1  29 

1 

3 

3 

19  3 

90 

12 

91 

153  51  14 

91.2 

2  1  4 

91.3 

1 

1 

93 

5 

6 

94 

7 

3 

1 

95 

14 

1 

11  1 

96 

9 

98 

4 

99 

2 

too 

101 

102 

103 

1 

6 

3 

1 

1 

* 

6 

9 

l 

success 

ful  carousel  feeding. 

In  Chapter  2,  I  identified  two 

primary 

behavioral  sequences  preceding  the  tail 

slapping  and  tight  circling  that  indicated  carousel  feeding.  Both  of  these  sequences  were 
classified  as  NTS  periods.  In  the  first  sequence,  carousel  feeding  was  anticipated  by  vertical 
excursions  and  broad  horizontal  looping,  a  set  of  behaviors  that  strongly  suggested  that  the 
animals  were  corralling  and  shoaling  herring  from  depth  (Nottestad  &  Simila,  2001; 
Nottestad  et  al.,  2002).  In  the  second  sequence,  feeding  occurred  after  a  period  of 
directional  travel  that  involved  minimal  or  no  looping,  'these  two  sequences  will  be  referred 
to  as  putative  corralling  and  putative  traveling,  respectively.  I  examined  the  presence  or 
absence  of  stereotyped  calling  associated  with  each  of  these  behaviors. 

For  the  next  analysis,  call  types  were  divided  into  two  general  categories:  those 


105 


Table  4.2.  Call  type  component  counts,  durations,  mean  frequencies  (continued  onto  next  two  pages). 


duration  (s)  mean  frequency  (Hz) 


call  type 

LFC  or  HFC 

Component  number 

N 

<7 

a 

8 

LFC 

1 

76 

0.94 

0.32 

1592.27 

315.26 

8 

HFC 

1 

67 

0.95 

0.27 

7530.44 

820.30 

9.2 

LFC 

1 

19 

0.71 

0.22 

343.60 

37.13 

9.2 

LFC 

2 

19 

0.81 

0.17 

2311.35 

133.29 

10 

LFC 

1 

72 

0.62 

0.16 

696.39 

247.58 

10 

HFC 

1 

52 

0.05 

0.03 

4910.92 

406.57 

10 

HFC 

2 

52 

0.69 

0.19 

8579.36 

1541.60 

10.3 

LFC 

1 

7 

0.72 

0.06 

571.15 

75.52 

12.1 

LFC 

1 

387 

1.55 

0.34 

1617.06 

292.76 

15 

LFC 

1 

8 

0.16 

0.07 

2040.73 

1170.76 

15 

HFC 

1 

1 

0.08 

4770.55 

16.1 

LFC 

1 

109 

0.69 

0.28 

1113.11 

158.66 

16.1 

LFC 

2 

109 

0.24 

0.13 

2190.94 

304.52 

16.1 

HFC 

1 

99 

0.11 

0.08 

5447.29 

619.48 

16.1 

HFC 

2 

99 

1.03 

0.37 

7639.61 

671.67 

21 

LFC 

1 

104 

1.16 

0.23 

970.70 

80.12 

21 

HFC 

1 

82 

0.68 

0.33 

6912.27 

479.28 

23.2 

LFC 

1 

62 

0.16 

0.06 

1919.91 

196.84 

23.2 

LFC 

2 

62 

1.09 

0.16 

694.55 

50.71 

23.3 

LFC 

1 

15 

0.30 

0.09 

2091.71 

152.86 

23.3 

HFC 

1 

8 

0.34 

0.08 

8627.46 

604.28 

26.2 

LFC 

1 

69 

1.11 

0.25 

1040.48 

127.51 

32 

LFC 

1 

226 

0.45 

0.08 

2280.51 

228.25 

32 

LFC 

2 

226 

0.70 

0.12 

805.70 

69.70 

32.2 

LFC 

1 

7 

0.47 

0.11 

2011.22 

80.46 

32.2 

LFC 

2 

7 

0.36 

0.11 

879.36 

51.55 

32.2 

HFC 

1 

5 

0.66 

0.11 

8739.74 

260.58 

45 

LFC 

1 

61 

0.09 

0.05 

1533.52 

550.59 

64 

LFC 

1 

161 

0.92 

0.27 

639.56 

132.99 

64 

HFC 

1 

4 

0.18 

0.05 

5492.33 

107.90 

64 

HFC 

2 

4 

0.74 

0.30 

6284.77 

155.52 

65 

LFC 

1 

8 

0.37 

0.06 

1209.53 

48.52 

65 

HFC 

1 

6 

0.50 

0.08 

5442.28 
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containing  only  a  low  frequency  component  (LFC)  and  those  containing  both  low  and  high 
frequency  components  together  (L/HFC).  Because  orientation  cues  (see  Introduction)  may 
have  been  especially  important  as  animals  were  carousel  feeding,  I  expected  this  foraging 
behavior  to  be  characterized  by  more  L/HFC  than  LFC  calls  if  they  function  as  orientation 
cues.  To  test  this  prediction,  I  compared  the  kinds  of  call  types  produced  during  NTS 
versus  TS  periods.  Of  the  9  tag  recordings  that  contained  vocal  activity  during  NTS  and  TS 
periods  (the  remaining  3  (i.e.,  oo05_316a,  oo05_320a,  oo05_324a)  were  characterized 
exclusively  by  NTS  activity),  only  7  were  considered  here.  The  other  two  (oo06_314a  and 
oo06__317s)  included  almost  all  LFC  calls  with  only  0  or  1  L/HFC  calls. 

An  odds  ratio  was  calculated  from  the  2x2  contingency  table  containing  the  actual 
tallies  of  LFC  and  L/FIFC  calls  during  NTS  and  TS  periods.  One  concern  with  this  dataset 
was  its  serial  dependence.  In  particular,  LFC  calls  tended  to  follow  LFC  calls  and  L/I  IFC 
calls  tended  to  follow  L/HFC  calls  significantly  more  often  than  expected  by  chance  (as 
assessed  by  a  chi-square  test).  To  test  for  significance  of  the  odds  ratio  and  accommodate 
the  inherent  serial  dependence  of  the  data,  a  rotation  test  was  conducted  that  rotated  the 
calling  sequence  relative  to  the  NTS  and  TS  divisions.  The  odds  ratio  was  calculated  for 
each  of  1000  such  rotations  and  this  distribution  was  compared  to  the  actual  value  to 
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determine  a  P-value. 


In  addition,  a  pooled  result  was  tabulated  across  all  of  the  whales  except  oo05_322b. 
line  tag  records  ot  individuals  oo05_322a  and  oo05_322b  were  not  independent  because 
they  were  tagged  simultaneously.  Animal  oo05_322a  was  retained  for  the  pooled  analysis 
because  of  the  greater  power  afforded  by  the  longer  recording  duration  (3.6  hours  instead  of 
3.1  hours).  A  single  2x2  contingency  table  was  formed  by  summing  the  data  from  the  6 
whales.  The  rotation  test  was  conducted  as  above  except  the  results  of  each  rotation  were 
summed  across  the  6  whales.  The  odds  ratio  was  computed  for  the  1000  rotations  and  this 
distribution  was  compared  to  the  actual  pooled  odds  ratio  value  to  determine  the  P-value. 

'line  next  analysis  examined  differential  call  use  more  closely  by  exploring  whether 
call  types  were  produced  at  different  rates  in  different  behavioral  contexts.  A  2  x  AT 
contingency  table  was  constructed  to  tally  the  number  of  each  of  the  N  call  types  that  was 
produced  during  the  NTS  versus  TS  behavior  stares.  A  parametric  bootstrap  using  a  fitted 
Markov  chain  model  was  employed  to  gain  more  power  than  a  rotation  test  would  have 
allowed  given  the  large  number  of  call  types.  The  stationary  distribution  was  derived  from 
the  set  of  call  type  transition  probability  estimates.  Rather  than  conditioning  on  the  first  call 
type  in  the  actual  sequences,  a  set  of  1000  randomized  realizations  of  the  calling  sequence 
were  generated  using  the  stationary  distribution  to  determine  the  first  call  type  for  each 
sequence.  The  chi-square  statistic  was  re-computed  for  each  realization.  Brown  (1974) 
formulated  a  formal  sequential  approach  that  operated  under  the  assumption  of 
independence  to  identify  sources  of  significance  in  two-way  contingency  tables.  Because  the 
data  here  were  not  independent,  I  used  this  test  informally  to  determine  the  3  call  types  that 
contributed  primarily  to  a  significant  chi-square  value.  The  procedure  involved  locating  the 
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cell  in  the  contingency  table  with  the  largest  deviation  between  the  observed  value  and  the 
expected  value  given  by: 


"V  - 


(>;  -  a){Cj  -  a,)/(N  - 1)  -  Cj  +  a) 


(4.1) 


where  /'  is  the  /h  row  sum,  c,  is  the y*  column  sum  and  a,:  is  the  observed  value  for  cell  (i,j). 
After  replacing  the  entry  of  this  cell  with  E-,  the  process  was  repeated.  Three  such 

iterations  yielded  the  three  call  types  that  contributed  principally  to  a  significant  chi-square 
score  (Brown,  1974). 


4.4.2  Results  and  discussion 

There  were  some  general  patterns  that  related  the  vocal  activity  and  movement 
behavior  of  the  animals.  Focusing  initially  on  NTS  periods,  movement  measurements 
during  bouts  of  calling  activity  were  compared  to  preceding  bouts  that  lacked  calling  activity. 
There  was  at  least  one  such  transition  during  a  NTS  period  from  a  bout  without  calling 
activity  to  a  bout  with  calling  activity  in  the  recordings  of  7  animals  (i.e.,  oo05_316a, 
oo05_320a,  oo05_322a,  oo05_322b,  oo05_324a,  oo06__313s  and  oo06_314a).  Figure  4.1 
plots  the  timing  of  stereotyped  pulsed  calls  on  the  depth  records  for  all  of  the  whales  whose 
records  were  analyzed  for  this  chapter.  For  male  oo06_314a,  for  example,  the  period 
without  calling  spanned  0-71  minutes  (purple  horizontal  bar)  and  the  subsequent  bout  of 
calling  activity  occurred  between  71  and  79  minutes  (green  horizontal  bar).  The  remainder 
of  the  record  was  excluded  from  this  particular  analysis  because  the  next  phase  of  calling 
activity  was  paired  with  tail  slapping  (orange  horizontal  bar).  Of  the  movement  measures 
compared  before  and  during  the  calling  bouts  for  the  7  animals,  calling  activity  bouts 
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Figure  4.1.  Depth  records  of  whales  with  calling  activity  overlaid.  During  NTS  episodes,  the  purple  bars 
indicate  the  non-calling  periods  that  precede  and  pair  with  the  green  bars  that  overlie  the  bouts  of  calling 
activity'.  Sections  without  a  horizontal  bar  belong  to  NTS  periods  that  are  not  part  of  such  a  pairing.  Orange 
bars  span  TS  episodes.  For  records  characterized  entirely  by  NTS,  calls  are  plotted  in  yellow.  For  records 
characterized  by  both  NTS  and  TS  periods,  LFC  calls  are  plotted  in  black  and  L/HFC  calls  are  plotted  in 
maroon.  (It  was  only  for  these  records  that  this  distinction  was  relevant  for  the  analysis.)  This  coloration 
scheme  was  not  applied  to  oo06_314a  or  oo06_317s  because  their  records  contained  only  1  or  0  L/l  IFC  calls, 
respectively. 
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Figure  4.2.  Mean  depth  (left)  and  standard  deviation  of  the  vertical  velocity  (right)  plotted  for  bouts  without 
calling  activity  versus  successive  bouts  with  calling  activity  during  NTS  periods  for  7  whales.  The  1:1  line  is 
plotted  in  each  figure.  With  the  exception  of  a  single  mean  depth  value  for  oo05_322b,  all  measurements  were 
smaller  during  a  calling  bout  than  in  the  non-vocal  time  period  preceding  it. 


consistendy  occurred  at  shallower  depths  (except  on  one  occasion  by  a  single  whale)  and 
were  characterized  by  less  variable  vertical  velocities  (Figure  4.2).  It  is  reasonable  that 
animals  occupying  shallower  depths  during  calling  periods  would  traverse  less  vertical 
distance,  leading  to  smaller  variation  in  their  vertical  velocities.  This  pattern  is  also  evident 
in  Figure  4.1.  No  further  trends  were  evident  for  the  standard  deviation  of  the  change  in 
pointing  angle,  change  in  roll,  or  fluking  intensity  (data  now  shown). 
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(a)  oo05_322a  LI  fb;  oo05_320b  LI 


\Y  -  E 


Figure  4.3.  Horizontal  tracks  of  two  tagged  killer  whales  with  the  linearity  index  (LI)  plotted  in  color.  Thin 
sections  of  the  track  correspond  to  an  absence  of  calling  behavior.  Each  call  has  been  plotted  by  thickening 
the  track.  In  both  plots,  the  dotted  square  boxes  indicate  the  portions  of  the  track  associated  with  tail  slapping 
and  concomitant  vocal  activity'.  In  panel  (a),  this  carousel  feeding  period  is  preceded  by  putative  corralling 
(dotted  circle)  that  contains  high  levels  of  calling.  In  panel  (b),  however,  the  preceding  period  of  directional 
travel  was  quiet  vocally. 

Once  these  different  movement  and  vocal  behaviors  during  NTS  periods  were 
identified,  their  relationship  with  trends  during  the  TS  episodes  were  explored.  Tail  slapping 
activity  was  consistently  matched  with  heightened  vocal  activity.  In  Chapter  2,  two  primary 
behavioral  sequences  were  found  to  anticipate  carousel  feeding:  one  that  involved  a 
preceding  period  ot  putative  corralling  and  another  that  involved  putative  travel.  In  all  4  of 
the  sequences  in  which  putative  corralling  anticipated  tail  slapping,  calling  activity  was 
observed  during  both  of  these  behaviors  (Figure  4.3a).  In  2  of  the  3  instances  in  which 
directional  travel  (but  no  putative  corralling)  preceded  the  tail  slapping,  calling  activity  was 
absent  during  travel  but  began  suddenly  once  the  tail  slapping  commenced  (Figure  4.3b). 
dEie  animals  were  vocally  active  during  feeding  regardless  of  the  prior  behavior,  suggesting 
that  they  were  communicating  with  one  another.  For  the  four  animals  whose  feeding  was 
preceded  by  putative  corralling,  their  vocal  activity  similarly  suggested  some  kind  of  acoustic 
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communication  or  elevated  arousal  level.  The  recording  from  female  oo05_322a  contained 
substantial  vocal  activity  during  the  earlier  directional  travel  periods  as  well  (Figure  4.3a). 

The  absence  of  vocal  activity  in  the  2  cases  of  travel  indicates  that  they  did  not  need  to 
communicate  acoustically  during  this  time  and/or  they  were  relying  on  passive  listening  to 
locate  distant  groups  of  animals  that  were  feeding  noisily.  In  the  case  depicted  in  Figure 
4.3b,  visual  observations  confirmed  that  the  tagged  animal  and  his  group  did  approach 
another  group  of  killer  whales  that  was  already  engaged  in  carousel  feeding. 

The  vocal  behavior  was  then  considered  more  closely  by  separating  it  into  the 
L/HFC  calls  that  feature  additional  orientation  cues  (Miller,  2002)  and  the  less  directional 
LFC  calls.  As  described  above,  I  predicted  that  the  group  carousel  feeding  activity 
associated  with  TS  periods  might  have  benefited  from  orientation  signaling  and  therefore 
may  have  been  characterized  by  more  L/FIFC  than  LFC  calls.  The  results  from  this  analysis 
are  presented  in  Table  4.3.  L/HFC  calls  occurred  more  during  TS  periods  and  LFC  calls 
more  during  NTS  periods  than  expected  by  chance  for  5  of  the  7  whales  that  were  tested 
(one  of  these  whales,  oo05_322b,  showed  borderline  significance).  The  pooled  analysis  in 
which  the  results  of  6  whales  were  considered  at  once  was  highly  significant  (P  <  0.001). 

This  finding  supports  the  notion  that  Norwegian  killer  whales  may  use  the  L/HFC 
calls  to  provide  an  orientation  cue  to  group  members  during  carousel  feeding.  Table  4.3 
reveals  that  the  proportion  of  LFC  calls  during  NTS  periods  was  consistently  higher  than 
during  TS  periods.  For  the  whales  displaying  significant  results,  the  proportion  of  L/HFC 
calls  during  TS  periods  was  always  larger  than  the  proportion  during  NTS  periods  even  if  the 
actual  counts  of  L/HFC  calls  were  almost  always  smaller  than  LFC  calls  in  both  TS  and 
NTS  periods.  Individuals  oo05_322a  and  oo05_322b  were  tagged  simultaneously.  Results 
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whale  id 

N 

L/HFC:  TS 

L/HFC:  NTS 

odds  ratio 

P-value 

oo05_320b 

663 

56.0 

18.9 

5.39 

0.007 

oo05_321a 

162 

46.2 

1.5 

39.34 

0.002 

oo05_322a 

1728 

31.4 

13.0 

3.04 

0.013 

oo05_322b 

1513 

25.4 

10.1 

3.00 

0.109 

oo()6_313s 

279 

46.8 

29.0 

2.14 

0.215 

oo06_314s 

766 

21.9 

3.4 

7.91 

0.012 

oo06_327s 

540 

15.7 

29.2 

0.46 

0.838 

Table  4.3.  Results  of  the  rotation  test  to  assess  pairing  of  L/HFC  or  LFC  calls  during  TS  versus  NTS  periods. 
The  first  and  second  columns  provide  the  identification  of  the  whale  and  the  total  number  of  calls  used  in  the 
test.  The  next  two  columns  list  the  percentage  of  L/HFC  call  production  during  TS  and  NTS  periods, 
respectively.  The  odds  ratio  and  the  significance  of  the  rotation  test  are  offered  in  the  last  two  columns. 


whale  id 

P-value 

call  types  acting  as  sources  of  significance 

oo05_320b 

<  0.001 

N12.  /,  N64,  N45 

oo05_321a 

<  0.001 

N8,  N16.LN85 

oo05_322a 

<  0.001 

N85.  N8.  N65 

oo05_322b 

<  0.001 

N10.  N8U  N85 

oo06_313s 

0.074 

N89.  N77.3,  N82 

oo06_314s 

<  0.001 

N73y  N85,  N72.2 

oo06_327s 

0.002 

(N15),  (N84),  N72.3 

Table  4.4.  Results  of  Markov  chain  test  to  compare  call  type  frequency  distributions  during  NTS  versus  TS 
periods.  A  significant  P-value  indicates  that,  once  the  senal  dependence  was  considered  by  modeling  the 
sequence  using  a  Markov  chain,  the  counts  of  call  types  differed  between  NTS  and  TS  episodes.  Call  ty  pes  are 
italicized  if  they  were  lowered  during  NTS  or  elevated  during  TS  periods  and  are  in  bold  if  there  were  elevated 
during  NTS  or  lowered  during  TS  periods  (the  predicted  outcomes).  L/HFC  calls  are  underlined  and  calls  that 
do  not  fit  into  either  LFC  or  L/HFC  are  listed  parenthetically.  Call  type  N15  was  characterized  by  a  short, 
fairly  flat  contour  but  spanned  a  wide  spectral  range  and  N84  contained  a  HFC  only. 


for  both  of  these  animals  are  reported  in  Table  4.3  though  they  are  not  independent.  The 
especially  large  P-value  for  oo06_327s  may  have  resulted  from  the  conservative  division  of 
NTS  and  TS  periods  used  here.  The  tag  record  of  this  whale  contained  either  putative 
corralling  (considered  NTS)  or  tail  slapping  episodes.  Both  of  these  behaviors  may  have 
benefited  from  a  higher  incidence  of  L/HFC  calls,  causing  the  lack  of  significance. 

The  next  analysis  explored  whether  different  call  types  were  used  in  the  two  different 
NTS  and  TS  behavioral  states.  When  a  Markov  chain  was  used  to  model  the  senal 
dependence  of  the  call  type  sequence,  all  of  the  7  whales  demonstrated  significant  results 
(Table  4.4;  one  animal,  oo06_313s,  showed  borderline  significance).  This  indicated  that  the 
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Figure  4.4.  These  plots  present  the  percentage  contribution  of  each  call  type  to  the  calls  produced  during 
NTS  (black  bars)  and  TS  (white  bars)  periods. 
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distribution  of  call  type  rates  was  significantly  different  between  the  NTS  and  TS  periods. 
Figure  4.4  plots  the  percent  contribution  of  each  call  type  during  NTS  and  TS  periods  for  7 
whales  displaying  both  behavioral  states.  Then,  the  call  types  contributing  to  this  significant 
difference  were  studied  more  closely  (see  last  two  columns  of  Table  4.4).  In  six  whales,  all 
three  of  these  call  types  were  consistent  with  the  pattern  observed  above:  LFC  call  ty  pes 
were  more  abundant  during  NTS  periods  (or  less  abundant  during  TS  periods)  and  L/HFC 
call  types  were  more  abundant  during  TS  periods  (or  less  abundant  during  NTS  periods). 
For  the  remaining  whale  (i.e.,  oo06_327s),  one  of  the  top  three  call  types  followed  this 
pattern,  one  covered  a  wide  spectral  range  (N15)  and  the  third  contained  a  I IFC  only  (N84). 
These  results  were  consistent  with  the  orientation  cueing  prediction  and  support  the 
possibility  that  the  killer  whales  produced  subsets  of  call  types  at  different  rates  depending 
on  the  behavioral  context. 

4.5  Analysis  2:  Incorporating  the  movement  sensors 

Sometimes  the  onset  or  ending  of  calling  activity-  corresponded  rather  closely  to  the 
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Figure  4.5.  Call  type  sequences  for  oo05_316a  (top)  and  oo05_322b  (bottom).  The  legends  to  the  right 
provide  the  call  type  labels  and  the  letters  above  the  square  plots  indicate  the  NTS  (N)  and  TS  (T)  documents. 
Notice  that  sometimes  call  type  onset  and  offset  occur  at  the  document  boundaries  (e.g.,  N8,  bottom  plot) 
whereas  other  times  they  do  not  (e.g.,  N10.3,  top  plot). 
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boundaries  of  the  NTS  and  TS  periods  while  other  times  there  appeared  to  be  little  obvious 
connection  (Figure  4.5).  Individual  oo05_316a,  for  example,  only  displayed  NTS  behavior. 
Its  calling  activity  showed  distinct  patterning  with  most  call  types  beginning  around  95 
minutes  (Figure  4.5,  top).  Other  call  types  occurred  earlier  and  call  type  N45  was  produced 
throughout  the  recording.  Certain  call  types  (e.g.,  N8,  N65,  N71,  N86)  clearly  clustered  in 
the  primary  TS  section  of  the  oo05_322b  recording  (Figure  4.5,  bottom).  Others  (e.g.,  N68, 
N72.3,  N73,  N77,  N78)  spanned  from  the  first  NTS  section  (containing  the  putative 
corralling)  through  the  primary  TS  section.  A  few  of  the  call  types  (e.g.,  N23.2,  N72,  N80, 
N81)  continued  to  be  produced  in  the  final  NTS  section. 

Although  it  is  possible  that  the  calls  types  may  have  been  produced  interchangeably, 
the  calling  sequence  patterns  described  in  Section  4.4  and  Figure  4.4  suggest  otherwise. 
Previous  approaches  have  studied  the  functions  of  killer  whale  call  types  by  testing  in  which 
behavioral  states  they  occur.  These  states  have  been  assumed  to  be  wholly  representative  of 
what  an  animal  can  do.  This  traditional  approach  of  using  a  state-dependent  model  (i.e., 
partitioning  the  record  into  NTS  and  TS  periods)  may  not  be  powerful  enough  to  capture 
the  rich  dynamics  of  vocal  behavior.  I  therefore  explored  what  could  be  gained  by  taking 
advantage  of  the  densely  sampled  sensor  data  and  relating  it  to  each  call  type.  In  the  first 
analysis,  I  considered  the  question  of  whether  different  call  types  triggered  particular 
movement  responses  in  the  tagged  animal.  In  the  second  analysis,  this  perspective  was 
reversed  by  investigating  whether  certain  movement  regimes  were  characterized  by  and 
could  predict  different  sets  of  call  types. 
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4.5.1  Methods 


In  this  section,  a  vocal  sequence  was  considered  to  be  a  marked  point  process,  or  a 
dme  series  of  calling  events  that  was  each  defined  according  to  a  call  type.  Five  continuous 
movement  features  were  incorporated  as  multivariate  data  streams:  depth,  pointing  angle, 
roll,  verdcal  velocity  and  fluking  intensity  (as  defined  above).  The  poindng  angle  and  roll 
quanddes  were  measured  in  radians.  A  running  mean  and  standard  deviadon  were 
computed  over  these  movement  data  using  a  window  length  of  30s.  Each  vocalizadon  could 
be  associated  with  the  coincident  values  of  these  measurements. 

A  muldvariate  analysis  of  variance  (MAN OVA)  was  conducted  first,  which  tested 
whether  the  mean  vectors  of  the  movement  data  associated  with  each  call  tvpe  were 
significandy  different.  The  running  means  and  standard  deviations  of  the  movement  data  as 
well  as  the  differences  in  the  mean  values  before  and  after  each  call  type  were  considered. 
This  approach  relied  on  computing  the  Wilks  lambda  criterion: 


A  = 


|  E  | 

H  +  E  | 


(4.2) 


where  E  and  H  are  the  error  and  hypothesis  sum-of-squares  and  cross-products  (SSCP) 

matrices,  respectively  (see  Huberty  &  Olejnik,  2006  for  further  elaboration,  derivation  and 

discussion).  The  F  test  statistic  was  then  calculated  as: 

p  _  1-A  dff-/>  +  l 
A  p 

where  p  was  the  number  of  outcome  variables  (i.e.,  5)  and  df  denoted  the  error  degrees  of 
freedom,  which  was  computed  as  the  difference  between  the  number  of  data  points  and  p. 
A  rotation  test  was  conducted  to  assess  significance  while  preserving  the  serial  dependence 


(4.3) 


123 


of  the  calling  data.  The  timing  of  the  calls  was  rotated  relative  to  the  movement  data  and  the 
mean  vectors  were  recomputed.  This  rotation  procedure  was  performed  100  times  and  the 
observed  value  of  the  F  statistic  was  compared  to  the  distribution  of  values  produced  by 
rotation  to  determine  a  P-value. 

The  second  analysis  conducted  here  involved  a  multinomial  logistic  regression  in 
which  the  dependence  of  the  distribution  of  call  type  on  the  multivariate  movement 
measurements  was  modeled.  The  regression  was  of  the  form: 


P(Y  =  s \x)  =  -f 


(4-4) 


where  g ■  (x)  =  was  a  linear  predictor  involves  a  constant  term  (since  x0  =  1 ) 


and  5  covariates  that  corresponded  to  each  of  the  movement  measurements  (Hosmer  & 
Lemeshow,  2000).  The  left-hand  side  of  equation  (4.4)  denotes  the  probability  that  a  calling 
event,  Y,  was  of  a  particular  type,  s,  given  a  particular  combination  of  movement  features,  x. 
There  were  a  total  of  n  call  type  categories.  This  multinomial  logistic  regression  model  was 
fit  to  the  dataset  through  maximum  log  likelihood  by  maximizing  the  likelihood  function: 


(4.5) 


To  preserve  the  serial  dependence  of  the  calling  behavior  when  assessing  significance,  the 


call  type  sequence  was  again  rotated  relative  to  the  movement  data  for  each  whale  and  the 


same  maximum  log  likelihood  value  jL  was  computed. 
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whale  id 

a 

dn 

oo05_316a 

0.06 

0.16 

0.01 

oo05_320a 

0.18 

0.03 

0.66 

oo05_320b 

0.54 

0.27 

1.00 

oo05_321a 

0.56 

0.57 

0.61 

oo05_322a 

0.16 

0.14 

0.48 

oo05_322b 

0.10 

0.21 

0.09 

oo05_324a 

0.21 

0.84 

0.40 

oo06_313s 

0.11 

0.54 

0.05 

oo06_3 1 4a 

0.07 

0.04 

0.12 

oo06_314s 

<0.01 

<0.01 

<0.01 

oo06_317s 

0.37 

0.10 

0.27 

oo06_327s 

0.26 

0.76 

0.10 

Table  4.5.  Results  of  MANOVA  test  for  the  12  whales  and  3  sets  of  movement  measures:  the  running 
standard  deviation  (a),  running  mean  (jjl)  and  difference  in  means  (d(ji).  Significant  P-values  are  marked  in  bold. 


4.5.2  Results  and  discussion 

The  results  of  the  MANOVA  test  are  presented  in  Table  4.5.  The  call  types  of  only 
one  recording,  oo06_314s,  were  significandy  associated  with  different  mean  vectors  of  the 
running  standard  deviadon  of  the  movement  features.  The  results  were  significant  for  3  of 
the  12  recordings  for  both  the  running  mean  and  the  change  in  mean  value.  Thus,  of  36 
whale-movement  stream  combinations,  only  7  were  significant  (19%  of  the  total.  Table  4.5). 
This  small  quandty  of  significant  results  supported  the  claim  that  the  call  tvpes  were 
interchangeable.  This  conclusion  was  reinforced  by  the  lack  of  significance  of  the 
multinomial  logistic  regression,  the  second  analysis  described  above,  for  all  whales  and 
movement  streams.  In  addition,  no  significant  findings  emerged  when  the  data  were 
inspected  with  multinomial  regressions  using  time  as  the  regressor  instead  of  movement. 
Multidimensional  scaling  and  principal  components  analysis,  techniques  that  reduced  the 
dimensionality  of  the  dataset  for  more  straightforward  handling  and  visualization  purposes, 
did  not  yield  any  evident  patterns  either.  Therefore,  no  convincing  evidence  was  found  to 
support  the  hypothesis  that  different  call  types  produced  by  the  group  were  associated  with 
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particular  movement  signatures  of  the  tagged  individual  or  vice  versa. 


4.6  General  Discussion 

The  results  reflected  certain  connections  between  the  vocal  and  movement  data  at  a 
very  broad  level.  Across  7  animals  during  NTS  periods,  bouts  of  calls  were  produced  closer 
to  the  surface  and  during  spans  of  less  variable  verdcal  velocity  compared  with  silent  periods 
occurring  immediately  prior  (Figures  4.1  &  4.2).  This  behavior  resembled  sperm  whale 
codas  that  tend  to  be  produced  near  the  surface  for  what  is  likely  a  social  communicadon 
purpose.  TS  episodes  were  dependably  associated  with  high  rates  of  calling  acdvity,  as  were 
4  out  of  4  putadve  corralling  episodes  preceding  instances  of  carousel  feeding.  On  2  of  3 
occasions  when  putadve  travel  andcipated  carousel  feeding,  the  absence  of  calling  behavior 
during  the  travel  pordon  gave  way  to  intense  vocal  acdvity  during  the  carousel. 

These  putadve  travel  periods  were  generally  silent  and  occasionally  punctuated  by  a 
sharp  turn  resulting  in  a  path  that  converged  on  a  carousel  feeding  opportunity  (see  Chapter 
2;  Figure  4.3).  This  suggests  that  the  animals  may  have  remained  quiet  to  rely  on  passive 
listening  to  locate  other  carousel  feeding  groups  whose  calling  and  tail  slapping  would  have 
rendered  their  position  and  acdvity  evident,  a  scenario  confirmed  with  visual  observations 
on  at  least  one  occasion.  More  data  are  required  to  assess  the  likelihood  of  this  hypothesis 
since  one  of  the  traveling  periods  did  contain  vocal  acdvity.  This  could  provide  a  structured 
opportunity  to  investigate  whether  animals  choose  to  vocalize  based  on  the  composition  of 
the  tail  slapping  group,  which  they  could  likely  determine  by  listening  to  its  vocal  repertoire. 
Putative  corralling  and  carousel  feeding  were  reliably  associated  with  heightened  calling 
behavior  (Figure  4.3).  It  is  possible  that  these  activities  produced  an  elevated  arousal  level 
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that  released  more  vocal  behavior.  This  relationship  has  been  observed  in  other  populadons 
of  killer  whales  (e.g.,  transients,  Deecke  et  al.,  2005).  An  alternative  explanation  for  this 
observation  provides  that  calling  is  somehow  necessary  for  the  proper  execution  of  these 
foraging-related  behaviors. 

The  next  set  of  results  supported  this  second  conclusion  since  L/HFC  call  types, 
vocalizations  containing  both  the  low  and  high  frequency  components  necessary  to  provide 
the  orientation  cue  hypothesized  by  Miller  (2002),  were  more  abundant  during  TS  versus 
NTS  periods.  The  reverse  was  true  for  LFC  call  types,  which  were  more  common  during 
NTS  episodes.  This  conclusion  was  consistent  both  when  calls  were  divided  broadly  into 
the  two  classes  of  LFC  and  L/HFC  types  (Table  4.3)  and  when  the  rates  of  individual  call 
types  were  considered  separately  (Table  4.4;  Figure  4.4).  Corralling  and  carouseling  involved 
multiple  animals  gathering,  positioning  and  condensing  a  ball  of  herring  to  allow  successful 
tail  slapping  and  feeding.  Killer  whales  involved  in  this  incessant  maneuvering  would  benefit 
from  knowing  the  location,  orientation  and  direction  of  movement  of  other  group  members. 
The  higher  incidence  of  L/HFC  call  types  during  TS  episodes  was  consistent  with  the 
prediction  that  the  animals  could  have  relied  on  the  orientation  cues  embedded  within  these 
calls  to  manage  the  ball  of  herring  more  effectively.  Because  L/HFC  calls  tend  to  be  louder 
(Miller,  2006),  it  is  also  possible  that  this  change  in  usage  was  due  to  louder  call  production 
intended  to  attract  other  groups  or  resulting  from  the  excitement  of  the  animals. 

This  first  set  of  results  suggested  differential  call  type  production  according  to 
behavioral  state.  Bouts  of  calling  behavior  often  commenced  and  terminated  rather 
suddenly  (Figure  4.5).  There  were  likely  transitions  occurring  between  behavioral  states 
more  refined  than  the  coarse  TS/NTS  distinction  made  earlier.  The  detailed  movement  data 
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streams  were  considered  more  closely  to  explore  whether  they  could  serve  as  reliable 
predictors  of  call  types  or  vice  versa.  Only  7  of  the  36  whale-movement  stream 
combinations  produced  a  significant  difference  between  the  means  of  the  movement  vectors 
when  they  were  separated  according  to  call  type  (Table  4.5).  No  significant  results  arose 
when  call  type  was  regressed  onto  the  multivariate  movement  features  or  when  several  other 
techniques  were  attempted.  The  second  set  of  analyses  did  not  support  the  nodon  that  call 
types  could  be  associated  reliably  with  disdncdve  sets  of  movement  features.  I  find  it  likely 
that  either  the  detailed  movement  data  were  not  effectively  or  wholly  characterizing  relevant 
behaviors  or  the  calls  had  been  sorted  into  call  type  categories  that  may  not  have  mattered  to 
the  animals. 

Indeed,  the  findings  presented  in  this  chapter  suggest  that  Norwegian  killer  whale 
call  types  are  not  entirely  interchangeable  as  they  may  function  to  inform  conspecifics  about 
the  3D  orientation  and  position  of  the  signaler.  Carousel  feeding  in  particular  requires 
multiple  animals  to  corral  and  maintain  the  spatial  integrity  of  a  ball  of  herring  for  successful 
foraging  to  occur.  During  this  dynamic  enterprise,  killer  whales  would  certainly  profit  from 
orientation  cues.  In  addition,  Ford  (1989)  proposed  that  the  group-specific  vocal  repertoires 
of  killer  whales  may  function  as  a  badge  for  group  identity  but  did  not  explain  why  more 
than  a  single  call  type  was  necessary  to  achieve  this  function.  It  is  possible  that  large 
repertoires  allow  animals  to  distinguish  association  patterns  and  levels  of  interaction  more 
finely.  For  example,  killer  whales  that  share  50%  of  their  repertoire  are  likely  to  encounter 
one  another  more  often  than  those  sharing  25%.  The  extent  of  repertoire  overlap  presented 
in  Table  4.1  could  therefore  be  used  to  formulate  a  hierarchy  of  degrees  of  social  association 
between  the  recorded  groups.  This  overlap  could  result  from  vocal  convergence  (see  Mitani 
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&  Gros-Louis,  1998;  Smolker  &  Pepper,  1999;  Hile  et  al.,  2000)  or  repertoire  matching 
(Beecher  et  al.,  1996;  see  Introduction). 

One  element  missing  from  the  findings  presented  here  concerns  the  behavioral 
consistency  of  individual  whales  over  dme  courses  that  extend  beyond  a  few  hours.  Data 
collection  efforts  in  the  future  should  therefore  focus  on  gathering  recordings  of  the  same 
animals  across  muldple  carousels  that  ideally  include  muldple  instances  of  both  the  traveling 
and  corralling  sequences  andcipaung  the  feeding.  Addidonal  data  are  also  necessarv  to  verify 
the  absence  of  a  reladonship  between  the  fine  movement  and  vocal  data  analyzed  here  in 
Secdon  4.5.  If  the  same  result  conunues  to  emerge,  then  either  the  killer  whale  pulsed  calls 
really  are  context-independent  or  we  lack  sufficient  informadon  about  other  social  or 
environmental  cues  that  may  be  triggering  certain  bouts  of  calling  activity.  Playback 
experiments  of  carousel  feeding  calling  sequences  and  tail  slaps  to  groups  of  traveling  killer 
whales  could  test  whether  these  animals  do  eavesdrop  on  and  exploit  conspecifics  that  have 
already  corralled  and  brought  a  ball  of  herring  to  the  surface.  Addidonal  research  effort  is 
likely  to  be  rewarded  with  the  discovery  of  more  links  between  the  vocal  and  non-vocal 
behavior  of  free-ranging  killer  whales,  testing  explicidy  whether  their  call  types  are 
interchangeable  and  functionally  equivalent. 
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Chapter  5.  Phonemic  Segment  Characterization  of  Norwegian 
Killer 

Whale  Call  Types 

5.1  Abstract 

Killer  whale  vocal  production  has  traditionally  been  categorized  by  human  observers 
into  a  set  of  discrete  call  types.  These  call  types  often  contain  internal  spectral  shifts,  silent 
gaps  and  synchronously  produced  low  and  high  frequency  components.  The  fundamental 
units  used  to  build  signals  into  a  repertoire  and  the  syntactic  rules  associated  with  that 
assembly  are  essential  to  understand  animal  vocal  communication  systems.  Inspired  bv 
human  speech  processing  techniques  and  algorithms,  this  chapter  tested  whether  call  types 
could  be  represented  by  a  set  of  flexibly  arranged  and  smaller  phonemic  segments.  In 
particular,  I  evaluated  whether  segmented  characterizations  of  stereotyped  Norwegian  killer 
whale  calls  yielded  automated  classification  results  of  contour  traces  that  paralleled  a 
classification  scheme  using  whole  call  type  designations.  Representations  of  calls  in  their 
entirety  or  as  sets  of  either  distinct  or  shared  syllables  did  achieve  similar  performance.  Calls 
composed  of  shared  segments  may  provide  a  more  parsimonious  approach  to  parsing  the 
vocal  stream  since  there  were  fewer  segments  than  call  types,  nearly  75%  of  all  call  tvpes 
contained  at  least  one  shared  syllable,  and  some  syntactic  patterns  were  evident.  Such  a 
system  could  flexibly  generate  the  killer  whale  vocal  repertoire  as  a  subset  of  all  possible 
combinations  of  segments. 

5.2  Introduction 

An  important  question  in  animal  behavior  and  communication  concerns  how  the 
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brain  archives  and  generates  a  sequence  of  acousdc  signals,  which  has  implications  for 
individual  and  interacdve  vocal  performance  (see  Lee  et  al.,  2004).  Chunking,  the  process  by 
which  serial  informadon  is  segmented  during  memory  formadon  to  facilitate  subsequent 
recall  (Simon,  1974;  Terrace,  1987),  can  help  explain  certain  perceptual  and  producdon 
features  of  sequences  generated  from  a  vocal  repertoire.  More  elaborate  repertoires  can  arise 
when  a  smaller  inventory  of  chunked  signals  are  reshuffled  to  generate  a  vast  array  of  new 
vocal  combinadons  (e.g.,  Dobson  &  Lemon,  1979).  Zebra  finches  (Taeniopygia  guttata),  for 
example,  copy  serial  strings  of  sung  syllables  and  intervening  periods  of  silence  from  a 
variety  of  adult  tutors  and  then  reorder  these  chunks  to  produce  their  own  song  (Williams  & 
Staples,  1992).  Similarly,  nightingales  (Luscinia  megarhynchos)  acquire  song  by  extracting  and 
rearranging  coherent  packages  of  3-5  song  types  from  longer  model  sequences  (Hultsch  & 
Todt,  1989).  Individual  birds  of  both  species  tended  to  segment  their  vocal  sequences  at 
different  breakpoints  and  likely  exploit  the  chunking  of  songs  to  memorize  a  larger  array  of 
vocal  components.  Chunking  helps  to  organize  vocal  informadon  by  breaking  received 
sequences  into  smaller  elements,  which  can  then  be  used  to  build  and  produce  identical  or 
novel  sequences. 

Syntax  may  be  broadly  defined  as  the  set  of  rules  that  inform  how  smaller 
communication  tokens  are  organized  into  larger  phrases  (Snowdon,  1982;  Hailman  & 

Ficken,  1986).  Likely  candidates  for  these  tokens  include  discrete,  intact  segments  that  are 
found  within  different  sequences.  Syntax  constrains  the  ordering  of  such  subunits  within  the 
set  of  all  possible  combinations  and  sequences  and  its  analysis  can  help  identify  the  chunks 
that  compose  a  repertoire.  Certain  features  help  define  the  boundaries  of  these  chunks.  In 
birdsong,  for  example,  syllables  are  generally  defined  as  continuous  and  discrete  elements 
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separated  by  silence  (Lemon  &  Chatfield,  1971;  Eales,  1985;  Eens  et  al.,  1989).  Indeed, 
zebra  finches  whose  songs  were  experimentally  interrupted  usually  stopped  singing  between 
song  syllables,  suggesting  that  the  syllable  was  a  meaningful  unit  at  least  in  terms  of  basic 
production  (Cynx,  1990).  In  human  speech,  positions  of  maximum  spectral  transition  are 
important  for  consonant  and  vowel  perception  (Furui,  1986).  Similarly,  Yurk  (2005)  used 
abrupt  and  discontinuous  spectral  shifts  to  define  boundaries  between  syllables  in  killer 
whale  ( Orcinus  orcci)  vocalizations. 

Marler  (1977)  distinguished  phonological  syntax,  in  which  elements  from  the 
repertoire  are  rearranged  in  specific  ways  to  generate  new  vocalizations  (the  focus  of  this 
chapter),  from  lexical  syntax,  in  which  the  meaning  of  a  newly  combined  vocalization  is 
derived  from  both  the  order  and  the  constituent  meanings  of  its  components.  Birdsong 
lasting  on  the  order  of  seconds  tends  to  be  built  from  individual  notes  whose  species- 
specific  phonological  arrangement  is  required  to  evoke  an  appropriate  behavioral  response 
(e.g.,  European  starlings  ( Stumus  vulgaris ):  Bohner  &  Todt,  1996;  white-crowned  sparrows 
( Zonotrichia  leucophrys)'.  Soha  &  Marler,  2001).  Humpback  whale  ( Megaptera  novaeangliae)  song 
lasts  on  the  order  of  minutes  and  is  constructed  phonologically  from  themes  that  consist  of 
repeated  phrases  (Payne  &  McVay,  1971).  These  songs  change  over  time  as  themes  are 
added  and  lost  and  phrases  are  modified  (Payne  et  al.,  1984;  Noad  et  al.,  2000).  Bird  and 
humpback  whale  song  repertoires  are  both  constituted  from  a  set  of  flexibly  arranged, 
smaller  and  more  basic  units  of  vocal  production.  The  temporal  and  spectral  properties  of 
these  tokens  differ  but  their  role  as  the  building  blocks  of  a  syntactically-organized  repertoire 
is  the  same. 

Vocal  production  characterized  by  phonological  syntax  has  been  observed  in  social 


133 


species  with  a  fixed  repertoire  of  sounds.  Sequences  of  discrete  calls  are  well  suited  for 
syntacdc  analysis  since  the  onset  and  ending  of  vocalizations  are  usually  evident  and 
stereotypy  can  facilitate  categorization  of  individual  calls  (Slater,  1973;  1983).  Killer  whales 
produce  such  a  set  of  stereotyped  pulsed  calls  (see  Ford,  1987;  Strager,  1993;  Filatova  et  al., 
2004)  and  they  live  in  stable  family  groups,  interact  vocally  in  a  variety  of  contexts  (Ford, 
1989;  Deecke  et  al.,  2005;  Van  Opzeeland  et  al.,  2005;  Simon  et  al.,  2007)  and  show  evidence 
for  acoustic  mimicry  (see  Bowles  et  al.,  1988;  Foote  et  al.,  2006).  Call  types  have  been 
classified  similarly  by  human  observers  sorting  calls  aurally  and  spectrographically  and  by 
neural  networks  relying  on  temporal  and  spectral  features  of  a  trace  of  the  fundamental 
frequency  (Deecke  et  al.,  1999;  Deecke  &  Janik,  2006).  A  portion  of  the  repertoire  has  been 
labeled  variable,  a  miscellaneous  class  of  vocal  behavior  containing  the  calls  that  have  not 
sorted  neatly  into  one  of  the  stereotyped  categories. 

Here,  I  take  advantage  of  techniques  that  have  been  developed  in  the  field  of  human 
speech  recognition  to  explore  phonological  syntax  in  Norwegian  killer  whales.  Two  of  the 
hallmarks  of  human  language  include  an  ability  to  generate  a  vast  array  of  words  from  a  set 
of  a  few  dozen  phonemic  units  and  to  use  recursive  or  hierarchical  procedures  to  form 
appropriate  combinations  of  these  signals  (Nowak  et  al.,  2000;  Hauser  et  al.,  2002).  Early 
attempts  at  using  words  to  drive  speech  recognizers  gave  way  to  phonemic  representations 
that  improved  performance  on  large  vocabularies  considerably  (see  Lee  et  al.,  1989). 
Traditionally,  killer  whale  call  types  have  been  regarded  as  the  fundamental  units  of  vocal 
production.  Fashioned  after  the  approaches  developed  in  the  arena  of  human  speech 
research,  orca  vocalizations  were  decomposed  in  this  study  to  test  whether  a  simpler  set  of 
phonemic  segments  may  define  chunks  that  can  be  assembled  to  form  the  repertoire.  Unlike 
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human  speech  processing  tasks  where  we  know  when  the  computer  has  conducted  an 
accurate  and  efficient  parsing,  I  assess  the  traditional  and  phonemic  models  of  killer  whale 
vocalizations  by  comparing  their  call  type  classification  efficiency.  In  this  chapter,  the  terms 
segment  and  syllable  are  used  interchangeably. 

A  few  additional  features  of  killer  whale  calls  motivate  their  decomposition  into 
smaller  segments  to  explore  the  possibility  of  phonological  svntax.  First,  the  pulsed  calls  can 
contain  synchronously  produced  low  and  high  frequency  components  (LFCs  and  HFCs, 
respectively,  Hoelzel  &  Osborne,  1986;  Miller  &  Bain,  2000)  that  are  presumably  generated 
by  two  independent  sources  located  in  close  proximity7  to  one  another  (see  Miller  et  al.,  2007 
for  a  discussion).  These  components  are  spectrographically  distinct  entities:  the  LFC  is 
characterized  by  a  fundamental  frequency  between  80  -  2400  Hz  (Ford,  1987)  whereas  the 
HFC  extends  between  2  and  12  kHz  (Hoelzel  &  Osborne,  1986).  The  proposed 
independent  control  required  to  produce  these  two  components  would  allow  the  animals  to 
increase  their  repertoire  dramatically  simply  by  varying  the  LFC  and  HFC  pairings. 

Second,  Norwegian  killer  whales  produce  compound  calls,  concatenations  of 
multiple  discrete  calls  that  are  produced  elsewhere  in  the  record  either  individually  or  within 
other  compound  calls  (Strager,  1993;  1995).  This  combinatorial  and  syntactic  rearrangement 
of  entire  call  types  to  generate  compound  vocal  patterns  suggests  that  these  animals  may 
take  advantage  of  a  similar  mechanism  to  fashion  the  call  types  themselves  from  a  set  of 
even  smaller  subunits.  The  call  type  could  even  be  considered  a  compound  call  itself  if  its 
subunits  were  also  produced  elsewhere  on  their  own.  The  approach  presented  here  offers  a 
simple  framework  for  understanding  killer  whale  vocal  behavior  bv  arranging  production 
into  segments  that  are  flexibly  combined  to  form  call  types,  which  can  be  variably  ordered 
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themselves  to  generate  the  compound  calling  sequences.  Third,  subtype  designations  have 
been  used  to  distinguish  “structurally  unique  variants”  of  a  call  type  based  on  1)  consistent 
temporal  or  spectral  differences  of  particular  features  or  2)  the  presence  or  absence  of 
certain  segments  (Ford,  1991;  Strager,  1995).  These  segments  may  serve  as  the  phonological 
tokens  that  are  syntactically  rearranged  to  generate  call  types.  Indeed,  Yurk  (2005)  extracted 
syllables  from  the  calls  of  resident  and  transient  killer  whales  living  in  British  Columbia, 
categorized  them  by  eye  (using  their  “gestalt”  to  aid  the  differentiation)  and  found  that 
human  classifiers  agreed  with  these  divisions.  Different  arrangements  of  these  phonemes 
formed  the  call  types  from  these  animals. 

Figure  5.1  presents  a  set  of  motivating  spectrograms  for  this  work.  Each  column 
contains  a  series  of  calls  that  are  successively  built  upon  one  another.  In  the  first  column, 
the  first  call  is  N72,  a  gradual  frequency  modulated  upsweep.  The  next  two  calls  in  the 
vertical  series,  N72.2  and  N72.3,  contain  the  same  upsweep  followed  by  one  or  two  very 
short  higher  frequency  segments,  respectively.  These  segments  are  separated  by  brief  (< 
0.1s)  spans  of  silence.  The  foundation  of  the  call  series  in  the  second  column  of  Figure  5.1 
is  N  16.1,  a  continuous  call  type  characterized  by  both  a  LFC  and  HFC.  Each  successive 
LFC  in  the  vertical  series  accumulates  an  additional  constant  frequency  or  slightly  frequency 
modulated  segment.  The  HFC  segments  are  added  gradually  with  N16.1  and  N81 
containing  one  segment  and  N91.2  and  N91.3  containing  two  segments.  In  addition  to 
sharing  structural  similarity,  call  types  N16.1  and  N91  occasionally  occupied  the  same  final 
position  in  the  compound  call  series  beginning  with  the  calling  sequence  N66  — »  N67.  The 
examples  presented  in  Figure  5.1  are  certainly  consistent  with  the  notion  that  a  set  of 
syllables  can  be  flexibly  arranged  and  sequenced  to  generate  a  large  repertoire  of  call  types. 
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Figure  5.1.  hxamples  of  call  types  that  motivated  this  study.  The  three  spectrogram  panels  on  the  left  contain 
the  same  primary  call  base  labeled  N72.  N72.2  contains  a  short  additional  utterance  at  the  end  of  the  call  and 
N72.3  contains  two  such  additional  utterances.  The  four  spectrogram  panels  on  the  right  can  be  assembled 
from  top  to  bottom  by  sequentially  adding  both  low  and  high  frequency  components  to  the  NI6.1  base. 


Here,  leveraging  techniques  developed  for  human  speech  recognition,  I  confirm  the 
earlier  observation  of  shared  syllables  by  Yurk  (2005)  and  extend  it  further,  evaluating 
formally  whether  phonemic  divisions  of  Norwegian  killer  whale  stereotyped  calls  allow 
automated  classification  to  call  type  with  an  efficiency  that  parallels  a  classification  scheme 
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based  on  whole  call  type  designations.  This  chapter  presents  evidence  that  the  performance 
of  these  two  approaches  is  comparable.  The  characterization  of  pulsed  calls  with  a  set  of 
shared  syllables  may  be  superior  to  the  whole  call  type  approach,  however,  because  the 
phoneme  model  is  simpler,  requiring  less  information  and  computation  to  represent  the 
dataset.  I  also  test  the  possibility  that  certain  variable  calls  may  be  built  from  the  same 
phonological  segments  constituting  the  stereotyped  calls.  Finally,  inspired  by  the  small  yet 
universal  set  of  phonemes  that  are  sub-sampled  to  form  each  human  language  (see  Schultz  & 
Waibel,  2001;  Zhu  et  al.,  2005),  I  quantify  how  completely  the  phonemic  inventor}’  derived 
from  Norwegian  stereotyped  calls  characterizes  the  repertoire  of  resident  and  transient  killer 
whales  from  the  Pacific  Northwest. 

5.3  Materials  and  methods 
5.3.1  Data  collection 

Field  work  was  conducted  in  November  2005  and  2006  in  the  northern  Norwegian 
fjords  of  Tysfjord  and  Vestfjord  near  the  Lofoten  islands  (~68°15’  N,  M6°E).  Fourteen 
free-ranging  killer  whales  were  outfitted  with  digital  archival  tags  that  sampled  audio  at  96 
kHz  and  individual  movement  at  50  Hz,  storing  all  data  to  flash  memory  (Johnson  &  Tyack, 
2003).  The  animals  were  approached  in  a  rigid  hull  inflatable  boat  and  the  tags  were 
attached  using  a  7m  carbon  fiber  hand  pole.  A  VHF  beacon  was  used  to  track  the  tagged 
animal  from  an  observation  platform  (the  sailboat  lolaire  or  the  Norwegian  research  vessel 
Sverdmp)  and  recover  the  tag  once  it  released  after  a  pre-programmed  period  of  time.  The 
data  were  offloaded  using  an  infrared  interface,  checked  for  errors,  backed  up  onto  CD  and 
then  cleared  from  memory  to  allow  re-deployment  of  the  tag.  A  total  of  31.8  hours  of 
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recordings  were  made  from  13  animals  (one  of  the  recordings  did  not  contain  anv 
vocalizations).  In  addition,  calls  produced  by  resident  and  transient  killer  whales  in  the 
Pacific  Northwest  were  recorded  with  towed  hydrophone  arrays  and  single  hydrophones, 
respectively  (see  Miller  &  Tvack,  1998;  Deecke  et  al.,  2005  for  data  collection  and  processing 
details)  and  were  kindly  provided  for  the  analysis  here. 

5.3.2  Call  type  assignments 

Initially,  all  recordings  were  audited  manually  to  listen  to  and  flag  every  vocalization. 
Calls  of  either  sufficient  signal-fo-noise  ratio  (SNR,  generally  10  dB)  or  spcctrographic 
visibility  were  used  for  classification  purposes.  Categorization  of  calls  to  type  by  visual 
inspection  of  spectrograms  was  used  here  since  earlier  studies  using  this  approach 
demonstrated  high  inter-observer  reliability-  scores  and  compared  favorably  to  automated 
approaches  involving  neural  networks  (Bain,  1986;  Ford,  1991;  Deecke  ct  al.,  1999;  Yurk  et 
al.,  2002;  Deecke  &  Janik,  2006).  Three  observers  and  I  each  sorted  spectrograms  of  calls 
from  each  recording  into  our  own  sets  of  categories.  Call  types  were  derived  for  each 
recording  from  spectrogram  sets  that  were  identically  classified  by  me  and  at  least  two  of  the 
observers  (see  Table  5.1  for  counts). 

Because  calls  from  different  tag  recordings  assigned  to  the  same  type  were  found  by 
visual  inspection  to  be  more  similar  than  calls  assigned  to  different  types,  call  types  were 
subsequently  compared  across  whales  to  determine  global  classes,  matching  them  whenever 
possible  to  pre-existing  call  type  labels  (Strager,  1993;  Van  Opzecland  ct  al.,  2005).  Strager 
(1993)  labeled  the  first  34  categories  with  the  initial  “N”  for  “Norwegian”  followed  by  a 
whole  number  to  indicate  the  call  type  (i.e.,  N1  to  N34).  Subtypes  were  designated  by  a 
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54 

oo05_322b 

120 

20 

50 

48 

34 

14 

341 

23 

oo05_324a 

41 

4 

5 

6 

5 

7 

70 

12 

oo06_313s 

113 

16 

19 

14 

17 

13 

139 

38 

oo06_314a 

126 

6 

11 

9 

14 

7 

196 

13 

oo06_3 1 4s 

150 

7 

18 

13 

16 

16 

220 

78 

oo06_317s 

35 

3 

8 

8 

7 

4 

13 

23 

oo06_324s 

18 

3 

6 

4 

3 

4 

5 

3 

oo06 327s 

258 

19 

38 

35 

24 

11 

250 

51 

total 

2331 

3530 

487 

Table  5.1.  Tabulation  of  call  and  type  counts  from  each  recording  used  for  visual  sorting  and  the  subsequent 
classification  experiments.  The  number  of  spectrograms  used  to  do  the  sorting  is  listed  in  the  second  column. 
The  next  set  of  4  columns  contains  the  call  type  counts  determined  by  each  of  the  observers.  Agreement 
between  me  and  at  least  two  of  the  observers  on  a  category  defined  it  as  a  call  type  and  the  number  of  types 
determined  from  each  recording  is  listed  in  the  next  column.  The  numbers  of  stereotyped  and  variable  calls 
actually  used  in  the  syntax  analysis  are  included  in  the  hist  two  columns  of  the  table.  Some  calls  were  not  used 
because  they  were  too  faint.  The  last  row  sums  these  counts  over  all  the  recordings  to  yield  the  total  amount  of 
data  analyzed. 


number  in  the  tenths  place  (e.g.,  N23.2).  Additional  call  types  from  N35  to  N63  were  added 
subsequently  by  Van  Opzeeland  et  al.  (2005).  New  categories  identified  here  were  assigned 
new  numbers  (N64  to  N103);  spectrograms  of  the  call  types  used  can  be  found  in  Appendix 
2.  For  this  analysis,  I  assume  that  the  initial  call  type  assignments  made  here  were  correct. 
Single  instances  of  a  call  that  could  not  be  classified  into  a  class  were  considered  variable 
calls  (1 1.6%  of  the  Norwegian  calls  considered  in  this  study).  The  calls  from  the  resident 
and  transient  killer  whales  in  the  Pacific  Northwest  had  already  been  sorted  to  type. 


5.3.3  Contour  tracing 

Contour  tracing  was  restricted  to  killer  whale  calls  with  evident  onset  and  offset 
(ensuring  representation  by  a  complete  trace  compared  with  calls  that  faded  in  or  out  in  the 
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spectrogram)  and  whose  contour  was  entirely  visible  (i.e.,  free  of  vocal  activity  or  surfacing 
noises  that  overlapped  and  obscured  the  call).  A  pitch  tracking  algorithm  developed  for 
human  telephone  speech  that  relies  on  the  harmonic  structure  of  a  vocal  signal  (Wang  & 
Seneff,  2000;  Wang,  2001)  was  used  to  trace  the  fundamental  frequency  of  both  the  low  and 
high  frequency  components  (LFC  and  HFC,  respectively)  of  killer  whale  calls  when  present. 
Three  separate  parameter  settings  were  used  to  optimize  traces  for  LFCs  below  600  Hz, 
LFCs  between  400  -  4000  Hz  and  HFCs  between  4  -  12  kHz  (Wang  &  Shapiro,  In  prep). 
Using  a  customized  Matlab  interface,  hi  pitch  contours  were  checked  manually  against  the 
original  spectrograms  and  if  necessary,  portions  were  smoothed  via  linear  interpolation, 
corrected  for  pitch  doubling  or  halving  errors  and  re-traced  using  the  mouse.  Each  contour 
was  further  prepared  by  manually  removing  any  sharp  and  noisy  onset  or  offset  transients 
that  could  not  be  reliably  traced,  rejecting  LFCs  and  HFCs  that  were  too  faint  to  discern 
completely,  standardizing  the  number  of  segments  (see  below)  according  to  call  type  and 
thresholding  the  allowable  proportion  of  non-tonal  (and  therefore  non-traceable)  contour 
according  to  call  type  segment. 

Research  on  birdsong,  killer  whale  calls  and  human  speech  have  established  syllable 
boundaries  at  intervening  periods  of  silence  and  abrupt  spectral  shifts  (see  Introduction). 
Killer  whale  calls  were  divided  into  segments  and  traced  non-continuously  based  on  these 
rules  (Figure  5.2).  The  contour  traces  of  all  calls  used  here  are  inventoried  in  Appendix  3. 
The  LFC  and  HFC  of  a  call  were  often  divided  into  segments  at  different  time  points. 

5.3.4  Segmentation  algorithm 

Several  calls  contained  quick  yet  spectrally  continuous  frequency  changes  (i.e.,  the 
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N9.2 


N72.3 


rime  (s) 


Figure  5.2.  Abrupt,  non-continuous  spectral  shifts  (left:  N9.2)  or  intervals  of  silence  (right:  N72.3)  in  the 
contour  were  considered  boundaries  defining  segment  edges  (marked  here  with  black  dotted  lines). 


N16.1  N32  N91 


time  (s) 

Figure  5.3.  Call  types  whose  LFCs  were  segmented  according  to  the  algorithm  described  in  the  text. 
Segmentation  divided  the  LFCs  of  these  calls  into  two  (N16.1,  N32)  or  three  (N91)  segments. 


LFCs  of  N16.1,  N32,  N91,  Figure  5.3).  These  transitions  were  also  used  to  separate  the  calls 
into  segments  but  it  was  less  obvious  where  the  divisions  should  be  made.  A  segmentation 
algorithm  was  developed  to  divide  the  contours  with  these  breakpoints  consistently  into  a 
certain  number  of  segments  according  to  call  type  (Figure  5.4).  First,  a  Legendre  polynomial 
curve  of  4th-order  was  fit  to  the  contour  (Aburdene  &  Dorband,  1996).  Although  any 
polynomial  family  would  have  performed  similarly,  the  class  of  Legendre  polynomials  was 
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Figure  5.4.  Contour  trace  of  N32  call  (see  Figure  5.3  for  sample  spectrogram)  in  black  (without  the  noisy 
onset)  with  4th-order  Legendre  polynomial  fit  of  entire  trace  superimposed  in  red.  The  segmentation  algorithm 
first  located  the  time  point  of  maximum  deviation  between  the  actual  contour  and  fitted  polynomial  (blue 
dotted  line).  Splitting  the  contour  into  two  at  this  location  and  then  scanning  forwards  and  backwards,  the 
time  point  that  minimized  the  deviation  in  fit  between  the  two  new  segments  (heavy  black  lines,  0.05s  to  either 
side  of  the  split)  and  their  individual  polynomial  fits  (magenta  lines)  was  used  as  the  location  of  the  actual 
segmentation  split  of  the  contour  (green  dotted  line). 


selected  because  of  its  precedence  in  human  speech  research  (i.e.,  used  to  characterize  the 
tones  of  Mandarin  Chinese:  Chen  &  Wang,  1990;  Wang  &  Seneff,  1998;  Wang,  2001). 

For  a  contour  being  split  into  two  (e.g.,  N16.1,  N32),  the  time  point  of  the  maximum 
difference  between  the  contour  and  the  polynomial  was  computed.  Using  this  as  an  initial 
breakpoint,  two  new  4th-order  Legendre  polynomials  were  fit  to  the  contour  traces  preceding 
and  following  it.  (This  polynomial  order  was  selected  because  of  precedence  in  the  literature 
(Chen  &  Wang,  1990;  Wang  &  Seneft,  1998;  Wang,  2001).  See  Discussion  for  further 
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elaboration  on  the  tradeoffs  of  this  decision.)  The  breakpoint  itself  was  exempted  from 
tracing  by  excluding  0.05s  to  either  side  of  it,  the  approximate  duration  of  this  transition. 

The  algorithm  then  slid  this  breakpoint  forwards  and  backwards  by  increments  of  0.1s  (i.e., 
the  sampling  interval  of  the  pitch  tracking  algorithm)  until  the  deviation  between  the  two 
resulting  contour  segments  and  two  new  polynomial  fits  was  minimized.  This  minimum  was 
taken  as  the  segmentation  point.  For  a  contour  being  split  into  three  (i.e.,  N91),  both  the 
maximum  and  second  highest  difference  between  the  whole  contour  and  the  fitted 
polynomial  were  determined.  Fixing  one,  the  second  was  shifted  forwards  and  backwards 
until  the  deviation  between  the  three  resulting  segments  and  polynomial  fits  was  again 
minimized.  The  second  was  then  fixed  at  this  point  and  the  original  was  shifted  forwards 
and  backwards  until  a  new  minimum  between  the  traces  and  fits  was  located.  This 
procedure  was  iterated  until  the  deviation  between  the  three  contour  segments  and  the  three 
new  polynomial  fits  was  globally  minimized. 

A  final  manual  review  of  all  automated  segmentation  decisions  was  conducted  to 
discard  the  calls  containing  erroneous  divisions  (22%  of  the  total)  from  all  3  experiments 
(see  next  section),  generally  resulting  from  discontinuities  elsewhere  in  the  pitch  track  or  an 
overly  gradual  change  in  the  trace  between  the  segments.  For  the  remaining  78%,  the  two  or 
three  segments  were  separated  from  one  another  by  non- traced  transition  regions  of  0.1s. 
RMS  values  were  calculated  between  the  difference  in  the  trace  and  curve  fit  of  every 
contour  before  the  segmentation  algorithm  was  imposed.  This  term  provided  a 
measurement  of  the  extent  of  frequency  modulation,  since  the  I  xgendre  polynomial  fit 
highly  modulated  contour  shapes  with  numerous  inflection  points  more  poorly,  leading  to  a 
greater  RMS  value.  I  expected  that  contours  selected  for  segmentation  with  this  algorithm 
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Figure  5.5.  RMS  error  difference  between  contour  traces  and  polynomial  fits  for  the  three  calls  on  which  the 
segmentation  algorithm  was  run  (top  panels)  and  for  all  other  low  and  high  frequency  components  (bottom 
panels).  Each  of  the  top  distributions  was  significantly  different  from  each  of  the  bottom  distributions  (Mann- 
Whitney  U  test,  P  <  0.0()1). 


would  be  characterized  by  greater  frequency  modulation  than  the  remaining  contours.  To 
verify  and  justify  this  claim,  a  Mann- Whitney  U  test  was  conducted  to  compare  the 
distribution  of  RMS  values  of  the  three  segmented  call  types  to  those  of  the  non-segmented 
call  types.  The  distributions  were  significantly  different  (P  <  0.01)  and  are  plotted  in  Figure 
5.5  for  purposes  of  graphical  comparison. 
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5.3.5  Call  classification  experiments 

Phonemic  classification  schemes  have  yielded  better  and  more  efficient  results  in  human 
speech  recognition  tasks  than  larger  whole  word  divisions  (see  Introduction).  In  particular, 
mixed  Gaussian  models  have  achieved  robust  performance  when  characterizing  the 
differences  between  phonemes  in  human  speech  (e.g.,  Bonafonte  et  al,  1996;  Schultz  & 
Waibel,  2001).  Based  on  these  observations,  a  set  of  experiments  was  designed  to  evaluate 
whether  describing  the  killer  whale  repertoire  using  syllables  categorized  calls  at  least  as 
efficiently  as  more  traditional  whole  call  type  divisions.  This  whole  call  type  approach  was 
the  standard  against  which  the  other  models  of  representation  were  compared.  Equivalent 
or  improved  efficiency  of  these  alternatives  was  needed  to  justify  their  ability  to  characterize 
the  repertoire  in  a  manner  consistent  with  earlier  studies  and  to  parse  this  repertoire  more 
parsimoniously  using  less  information.  The  experiments  employed  here  involved 
summarizing  call  type  or  segment  features  using  mixed  Gaussian  models.  Only  call  types 
with  at  least  10  exemplars  were  considered;  this  reduced  the  total  number  of  call  types  from 
62  to  31.  Figure  5.6  summarizes  these  three  experiments  graphically. 

5. 3. 5. 1  Whole  contour  experiment  (If7 CE ) 

This  experiment  operated  under  the  classic  view  that  considered  the  entire  call  type 
to  be  the  basic  unit  of  killer  whale  vocal  production.  A  4u'-order  Legendre  polynomial  was 
fit  to  the  entire  LFC  and/ or  HFC,  linearly  interpolating  between  segments  separated  by 
silent  intervals.  This  interpolation  permitted  the  entire  call  to  be  represented  continuously, 
similar  to  how  a  human  sorting  spectrograms  might  consider  the  vocal  and  silent  pieces 
together  when  evaluating  whole  calls.  Six  parameters  that  characterized  the  properties  of 
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Figure  5.6.  Schematic  illustration  of  polynomial  and  labeling  treatment  of  contours  for  each  of  the  three 
experiments  described  in  the  text.  The  top  row  depicts  the  original  traces  for  the  low  frequency  components 
of  3  different  call  types:  N  12.1,  a  continuous  and  descending  vocalization;  NT  6.1,  a  continuous  call  subjected  to 
the  segmentation  algonthm  due  to  its  rapid  internal  frequency  change  marked  by  the  thin  dotted  line;  N72.2,  a 
call  containing  two  segments  separated  by  a  brief  period  of  silence.  The  second  row  plots  the  4th-order 
Legendre  polynomial  fits  used  in  the  first  whole  contour  experiment  (\VCE)  in  orange.  Each  contour  was 
considered  continuous  and  silent  intervals  were  interpolated  over  (indicated  by  the  thick  dotted  line  connecting 
the  two  N72.2  segments).  The  contours  were  labeled  by  adding  zeros  until  two  places  to  the  right  of  the 
decimal  point  were  filled.  The  unshared  segment  experiment  (USE)  is  shown  in  the  third  row.  Here,  the 
segments  of  each  call  were  honored  with  a  polynomial  representation  (NT  6.1  mid  N72.2  were  divided  into  two 
segments  using  the  segmentation  algorithm  and  silent  interval,  respectively)  but  were  labeled  distinctly.  The 
hundredths  place  in  the  label  was  used  to  count  each  successive  segment  for  a  particular  call  type.  The  final 
row  demonstrates  treatment  according  to  the  shared  segment  experiment  (SSE).  The  segmentation  decisions 
and  polynomial  fits  were  the  same  as  in  the  USE  but  the  labeling  allowed  call  types  to  share  syllables.  l;or 
example,  the  second  segments  of  both  N16.1  and  N72.2  were  collapsed  into  syllable  category  4.  See  the  text 
for  a  description  of  the  syllable  collapse.  Segments  that  appeared  only  in  a  single  call  retained  their  USE 
label  (e.g.,  12.11). 


each  contour  were  calculated:  (1)  the  duration  provided  temporal  information,  (2  -  5)  the 
first  4  Legendre  coefficients  represented  the  basic  spectral  shape  and  (6)  the  RMS  error 
between  the  polynomial  curve  and  the  actual  trace  captured  the  extent  of  frequency 
modulation.  Deriving  6  features  from  the  fundamental  frequency  alone  represented  a  radical 
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oversimplification  of  the  spectrogram  since  this  consolidation  lacked  information  about  the 
energy,  harmonics  and  other  details  of  the  original  signal.  In  human  speech,  for  example, 
the  intelligible  properties  of  vowels  are  contained  in  the  harmonics.  Such  information  in  the 
killer  whale  calls  might  have  been  lost  using  the  curve  fits  employed  here  but  the  goal  was  to 
provide  as  simple  a  rendering  as  possible  in  this  first  treatment  of  the  dataset. 

For  each  call  type,  a  training  set  was  formed  from  a  randomly  selected  90%  of  its 
contour  traces.  This  90-10  split  allowed  the  training  set  to  be  formed  from  the  majority  of 
the  dataset  while  leaving  a  sufficiently  large  test  set  to  assess  categorization  efficiency  (see 
Heuber  et  al,  2007  for  discussion  about  the  stability  of  such  leave-one-out  approaches; 
Elisseeff  &  Pontil,  2002  for  a  human  speech  application).  A  multi-normal  distribution  was 
then  defined  for  each  call  type  by  computing  the  mean  vector  and  covariance  matrix  of  the  6 
features  of  its  corresponding  training  set.  All  of  the  remaining  10%  of  the  calls,  the  test  set, 
were  subsequently  assigned  to  a  call  type  by  selecting  the  class  whose  training  set  data  yielded 
the  highest  multi-normal  probability  density.  The  calls  assigned  to  the  training  and  test  sets 
were  preserved  across  the  three  experiments. 

A  test  contour  containing  a  LFC  alone  was  compared  against  the  LFC  of  all  contours 
that  had  a  LFC  only  or  a  LFC  and  HFC.  A  test  contour  containing  a  HFC  alone  was 
compared  against  the  HFC  of  all  contours  that  had  a  HFC  only  or  a  LFC  and  HFC.  Finally, 
a  test  contour  containing  a  LFC  and  HFC  was  compared  against  only  those  contours 
containing  both  as  well.  In  this  last  case,  the  multi-normal  probability  density  of  the  LFC 
evaluation  was  added  to  that  of  the  HFC  evaluation  and  the  call  type  yielding  the  maximum 
sum  constituted  the  matching  class.  This  density  summation  was  considered  a  mixed 
Gaussian  model.  In  general,  for  this  and  all  experiments,  classification  was  aided  by  only 
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considering  as  possible  matches  those  call  types  that  were  characterized  by  the  same 
component  assignments  (just  described)  and  number  of  segments  (see  USE  below)  as  the 
test  contour.  All  three  experiments  were  assessed  based  on  how  often  an  incorrect  call  type 
was  chosen  for  each  member  of  the  test  set.  Results  were  reported  as  error  rates  ±  the 
standard  error  (s.e.),  which  was  computed  as: 

«. = ^ 

where p  was  the  error  rate  and  N  was  the  sample  size  of  the  test  set.  The  standard  error 
calculation  allowed  comparison  of  the  scoring  between  the  three  experiments  to  provide  an 
indication  of  improved,  weakened  or  equivalent  performance. 

5. 3.5. 2  Unshared  segment  experiment  (USE) 

This  experiment  operated  under  the  alternative  view  that  call  types  were  composed 
of  different  segments  but  did  not  allow  multiple  calls  to  share  the  same  segment.  Even- 
segment  type  (N  ~  62)  was  therefore  considered  individually  and  formed  its  own  Gaussian 
model.  For  calls  with  multiple  segments,  the  multi-normal  probability  density  was  computed 
for  each  segment  and  summed;  the  maximum  sum  determined  the  matching  class.  In  a 
second  scoring  for  this  experiment,  performance  was  also  rewarded  for  a  selected  call  type 
match  that  contained  a  combination  of  the  same  phonemic  classes  (see  SSE)  as  the  correct 
call  type  (i.e.,  confusions  with  segments  from  other  calls  but  with  the  same  phonemic  label 
were  forgiven).  The  interv  ening  0.1s  spans  between  segments  determined  by  the 
segmentation  algorithm  were  ignored  by  the  USE  and  SSE. 
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5. 3.5. 3  Shared  segment  experiment  (SSE ) 

This  experiment  operated  under  the  alternative  view  that  calls  were  composed  of 
different  segments  that  may  have  been  shared  across  type.  These  segments  were  collapsed 
into  phonemic  classes  by  consolidating  syllables  that  confused  with  one  another  when 
separated  from  their  call  type  of  origin  and  sorted,  resulting  in  26  syllables.  The  mean  vector 
and  covariance  matrix  were  computed  for  each  shared  mid  call-specific  phonemic  class  to 
generate  the  Gaussian  models.  Although  each  call  type  was  defined  by  a  unique 
combination  of  syllables,  certain  LFC  or  HFC  segment  sequences  were  occasionally  shared 
across  types  (see  Results  and  Discussion).  Certain  call  types  had  exemplars  that  contained 
both  a  LFC  and  HFC  but  had  entries  in  the  test  set  that  contained  traces  of  the  LFC  or  HFC 
alone  because  the  SNR  of  the  other  component  was  too  low  for  accurate  pitch  tracking. 
Because  of  the  inability  to  resolve  the  confusion  in  favor  of  a  correct  assignment  arising 
from  these  overlapping  syllable  combinations,  these  test  set  entries  were  considered  to 
match  incorrectly  during  the  evaluation  for  this  experiment. 


5.3.6  Rarefaction 

As  each  of  the  13  recordings  was  added  to  the  analysis,  the  rate  at  which  new 
syllables  accumulated  was  examined  as  a  function  of  the  rate  at  which  new  call  types  were 
contributed  to  the  repertoire.  The  mean  number  of  call  types  (or  syllables),  s ,  observed  in  a 
sample  of  m  individuals  was  given  by: 


*(m-lA  I(m\ 

j(»)  =  fC-£  J 


y=iv 


m 


\m  J 


where  K  was  the  total  number  of  call  types  (or  syllables),  M  was  the  total  number  of 
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individuals  and  I .  was  the  number  of  individuals  in  which  call  type  j  occurred  (analogous  to 
certain  formulations  of  biodiversity  discover}’  curves:  see  Solow  &  Smith,  1991).  For  the 
data  here,  K—  31  call  types  or  K  —  26  syllables  and  At  —  14  animals. 

5.3. 7  Variable  Norwegian  killer  whale  calls  and  stereotyped  Pacific  Northwest 
resident  and  transient  killer  whale  calls 

Variable  Norwegian  calls  and  stereotyped  calls  from  resident  and  transient  animals  in 
the  Pacific  Northwest  were  evaluated  identically  and  will  be  referred  to  collectively  as 
“alternative  calls.”  These  calls  were  separated  into  segments  based  on  intervening  penods  of 
silence  (described  in  section  5.3.3).  The  segmentation  algorithm  was  applied  to  rill  variable 
call  traces  with  an  RMS  value  exceeding  99%  of  the  RMS  values  of  stereotyped  LFC  and 
HFC  contours  that  were  not  processed  with  the  segmentation  algorithm  (bottom  two 
histograms  in  Figure  5.5).  Many  of  these  contours  did  not  contain  an  obvious  breakpoint 
for  segmentation  (due  to  substantial  frequency  modulation,  for  example).  The  resulting 
divisions  were  inspected  visually  and  only  reasonable  segmentation  decisions  were  retained 
(24%  of  the  total).  The  other  contours  were  included  as  single,  non- segmented  traces.  Each 
of  the  26  Norwegian  stereotyped  phonemic  classes  was  defined  by  a  set  of  self-sorted 
density  values  resulting  from  correct  pairings  of  training  and  test  set  members  (see 
desenption  of  these  syllable  pairings  and  the  collapse  technique  in  the  SSH  section  above). 
Each  alternative  call  segment  was  characterized  by  the  same  6  measures  presented  in  section 
5.3.5  above  and  classified  to  the  Norwegian  stereotyped  phonemic  class  that  returned  the 
largest  multi-normal  probability  density.  Assessment  was  conducted  by  tabulating  the 
proportion  of  self-sorted  density  values  of  this  matching  class  that  was  exceeded  by  the 
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density  value  of  the  alternative  call.  If  this  density  value  of  the  alternative  call  were  greater 
than  25%  of  the  self-sorted  values,  for  example,  this  would  be  considered  a  match  at  the 
25%  self-sorted  threshold. 

5.4  Results 

Of  the  3696  calls  traced,  3530  belonged  to  31  call  types  (16  with  LFC  only,  3  with 
HFC  only  and  12  with  both  LFC  and  HFC)  containing  at  least  10  exemplars.  By  excluding 
some  5.3%  of  the  dataset  belonging  to  call  types  with  9  or  fewer  samples,  certain  syllables 
may  have  been  missed  but  an  insufficient  number  of  traces  would  have  been  available  to 
summarize  these  classes  quantitatively.  In  addition  to  tallying  call  type  counts,  Figure  5.7 
accumulates  a  count  of  the  stereotyped  call  type  categories  used  in  this  analysis  according  to 
the  number  of  calls  that  were  assigned  to  each  type.  The  31  call  types  provided  the  basis  of 
comparison  for  the  whole  contour  experiment  (WCE)  whose  treatment  of  the  dataset 
paralleled  traditional  whole  spectrogram  sorting.  Table  5.2  tallies  the  number  of  call  types 
and  contour  traces  that  were  divided  into  segments  based  on  intervening  periods  of  silence 
and  quick  spectral  shifts.  The  unshared  segment  experiment  (USE)  assumed  that  each  call 
type  was  composed  of  a  set  of  distinct  segments  that  were  unshared  across  type.  A  total  of 
62  different  segments  (39  LFC  segments  and  23  HFC  segments)  was  considered  by  the  USE. 
The  shared  segment  experiment  (SSE)  allowed  call  types  to  share  syllables,  reducing  this 
number  to  26  different  syllables  (19  LFC  syllables  and  7  HFC  syllables).  Naturally,  LFC 
syllables  tended  to  occupy  a  lower  frequency  range  than  HFC  syllables.  The  syllables  also 
varied  in  shape  and  duration  and  were  labeled  numerically  in  the  order  that  they  were 
identified. 
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Figure  5.7.  Top:  Number  of  contours  of  each  call  type  used  in  the  3  experiments  described  in  the  text. 
Bottom:  Cumulative  plot  of  the  number  of  call  types  according  to  the  number  of  calls  associated  with  each 
type. 


The  categorization  efficiency  of  all  three  experiments  was  equivalent  (error  rates  + 
standard  error  for  WCE:  0.079  ±  0.014;  USE  first  scoring:  0.093  ±  0.015;  USE  second 
scoring:  0.091  ±  0.015;  SSE:  0.071  ±  0.014).  The  distributions  of  the  error  rates  formed 
from  100  runs  of  these  experiments  overlapped  (Figure  5.8).  In  other  words,  representing 
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component 

number  of  segments 

number  of  call  types 

number  of  contour  traces 

LFC 

2 

7 

775 

3 

2 

161 

HFC 

2 

8 

309 

Table  5.2.  Counts  of  call  types  and  contour  traces  affected  by  segmentation  (see  sections  5.3.3  and  5.3.4  in 
Materials  &  Methods). 


self-sorted 
threshold  (%) 

Norwegian 
variable  calls 

resident 

stereotyped  calls 

transient 
stereotyped  calls 

5 

72.4 

489 

30.7 

59 

56.0 

116 

10 

53.5 

361 

20.3 

39 

39.6 

82 

25 

26.2 

177 

3.6 

7 

11.6 

24 

50 

10.8 

73 

0 

0 

2.9 

6 

75 

4.0 

27 

0 

0 

0.0 

0 

90 

1.9 

13 

0 

0 

0.0 

0 

95 

1.1 

7 

0 

0 

0.0 

0 

Table  5.3.  Performance  of  alternative  calls  when  evaluated  against  self-sorted  Norwegian  stereotyped  call  data. 
The  first  column  in  the  table  lists  the  self-sorted  threshold  that  the  alternative  call  needed  to  exceed  to  qualify 
as  a  match  (see  Section  5.3.7).  For  each  alternative  category  heading  (i.e.,  Norwegian  variable  calls  and  Pacific 
Northwest  resident  and  transient  stereotyped  calls),  the  first  and  second  columns  contain  the  percent  and  count 
of  calls,  respectively,  that  matched  one  of  the  Norwegian  stereotyped  syllables  at  the  performance  threshold. 


stereotyped  calling  behavior  in  terms  of  whole  calls,  unshared  syllables  and  shared  syllables 
all  provided  equally  strong  categorization  results.  The  classification  of  whole  call  types  by 
human  judges  was  the  standard  against  which  these  efficiencies  were  scored.  The  discovery 
curves  in  Figure  5.9  revealed  that  the  numbers  of  both  call  types  and  syTllables  apparently 
saturated  after  the  first  7  or  8  animals  were  considered  (top  panels).  The  eventual  rate  at 
which  syllables  accumulated  was  lower  than  that  of  call  types. 

Results  of  the  Norwegian  variable  calls  (N  =  576  calls;  675  segments)  and  Pacific 
Northwest  resident  (N  —  192  calls  and  segments  since  each  call  only  had  a  single  segment) 
and  transient  (N  =162  calls;  207  segments)  stereotyped  calls  against  the  self-sorted 
Norwegian  stereotyped  calls  are  presented  in  Table  5.3.  The  values  in  this  table  correspond 
to  the  percentage  (or  number)  of  alternative  calls  whose  probability7  density7  score  associated 
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Figure  5.8.  Histograms  of  error  rates  for  each  experiment  after  100  runs.  Call  contours  were  categorized  to 
type  based  on  the  entire  call  type  (upper  left),  unshared  segments  (bottom  row)  or  a  combination  of  shared  and 
call-specific  segments  (upper  right).  The  unshared  segments  were  scored  strictly  (i.e.,  each  segment  within  a 
call  type  had  to  match  to  itself  for  the  call  to  succeed,  lower  left)  and  more  leniently  (i.e.,  a  call  could  also 
succeed  if  each  of  its  composite  segments  matched  to  segments  belonging  to  the  same  phonemic  classes,  lower 
right). 


with  its  best  Norwegian  syllable  match  exceeded  a  threshold  calculated  from  a  density 
distribution  of  this  Norwegian  syllable  when  it  correctly  matched  to  its  own  model.  The 
variable  calls  outperformed  both  sets  of  Pacific  Northwest  calls  and  the  transient  calls 
showed  better  performance  than  resident  calls.  At  the  25%  self-sorted  threshold  (see 
Section  5.3.7),  84%,  43%  and  96%  of  the  variable,  resident  and  transient  segments, 
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Figure  5.9.  Rarefaction  curves  plotting  the  number  of  call  types  (top  left)  and  shared  syllables  (top  right)  as  a 
function  of  the  number  of  whales  considered.  Bottom:  Normalized  number  of  shared  syllables  plotted  against 
normalized  number  of  call  types  and  viewed  for  all  data  (left)  and  zoomed  region  (right,  magnified  to  box 
depicted  on  left). 


respectively,  matched  with  one  of  the  shared  syllables  from  the  stereotyped  Norwegian 
repertoire.  Figure  5.10  plots  a  sampling  of  the  best  matches  for  each  alternative  call  category 
superimposed  upon  its  corresponding  syllable  class. 

After  condensing  the  number  of  syllables  required  to  represent  the  vocal  repertoire,  I 
examined  how  many  call  types  drew  on  this  shared  pool  and  what  their  patterns  of 
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Figure  5.10.  Sampling  of  marches  of  alternative  calls  (dark  black  traces)  with  Norwegian  stereotyped  call 
syllable  matches  (gray  traces).  The  alternative  calls  included  Norwegian  variable  (first  column).  Pacific 
Northwest  resident  (second  column)  and  transient  call  segments  (third  column)  that  performed  at  threshold 
percentages  (see  Table  5.3)  of  90%,  25%  and  25%,  respectively.  The  title  of  each  sub-panel  lists  the  number  of 
the  syllable  match  and  whether  the  match  was  a  LFC  or  HFC.  The  LFC  or  HFC  label  also  refers  to  the 
component  of  the  Pacific  Northwest  calls  and  the  titles  in  these  sub-panels  conclude  with  the  call  t\pe  of  the 
resident  or  transient  call.  The  number  following  the  hyphen  in  the  transient  call  label  refers  to  the  segment 
number. 


occurrence  were.  Fifteen  syllables  appeared  in  only  one  call  type.  The  remaining  7  LFC  and 
4  HFC  syllables  were  formed  from  collapsing  the  segments  from  at  least  two  different  call 
types  (Figure  5.11).  Table  5.4  counts  the  number  of  call  types  (altogether  and  separated 
based  on  presence  or  absence  of  LFC  and/or  HFC)  composed  entirely  of  shared  syllables,  a 
mixture  of  shared  and  call-specific  syllables  and  entirely  of  call-specific  syllables.  All  but  one 
of  the  shared  LFC  syllables  constituted  the  entire  LFC  in  at  least  one  call  type.  Three  of  the 


syllable  1,  HFC  syllable  2,  HFC  syllable  3,  LFC  syllable  4,  LFC 


Figure  5.11a. 
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Figure  5.11b.  a:  Syllables  found  in  at  least  two  call  rvpes.  Note  different  axis  scales,  b:  Color  code  for  syllable  traces  of  different  call  type  origins  (legend 
locations  here  match  trace  plot  locations).  The  digits  before  the  decimal  point  and  the  first  digit  after  the  decimal  point  refer  to  call  type  (e.g.,  N8,  N76.2, 
etc.).  The  second  digit  after  the  decimal  point  corresponds  to  the  segment  number  in  the  call.  (See  f  igure  5.6  for  further  elaboration.)  Certain  numbers  are 
repeated  in  this  figure  bur  this  ambiguity  is  resolved  when  differentiating  between  the  LFC  and  HFC  indicated  in  the  titles  of  the  sub-panels  of  the  figure. 


call-specific  syllables 

syllable  mixture 

shared  syllables 

all  call  types 

8 

5 

18 

LFC  &  HFC 

2 

2 

8 

LFC 

6 

3 

7 

HFC 

0 

0 

3 

Table  5.4.  Counts  of  all  call  types  (first  row),  call  types  containing  both  a  LFC  and  HFC  (second  row),  a  LFC 
alone  (third  row)  and  a  HFC  alone  (fourth  row)  that  were  comprised  of  call-specific  syllables  only,  a  mixture  of 
shared  and  call- specific  syllables,  and  shared  syllables  only. 

seven  LFC  syllables  arose  in  multiple  call  types  at  the  same  position  in  a  series  (i.e.,  first  or 
last)  and  two  appeared  in  multiple  call  types  at  different  positions  in  a  series. 

Now  I  will  describe  three  syllables  in  greater  detail  that  occurred  commonly  and  at  a 
reliable  location  in  a  sequence.  Syllable  7  was  produced  in  the  first  position  of  a  two-syllable 
sequence  in  two  call  types  and  never  in  the  last  position.  Of  the  13  HFCs  constructed  from 
at  least  one  shared  syllable,  12  contained  syllable  1,  syllable  2  or  both  (see  Figure  5.1 1  for 
plots  of  these  syllables).  In  fact,  4  call  types  had  HFCs  built  from  a  syllable  2  — ►  1  sequence. 
The  corresponding  LFCs  for  these  call  types  contained  syllables  that  were  all  shared  with  at 
least  one  other  call.  Syllable  1  occurred  second  in  the  only  other  instance  in  which  it  was 
paired  with  another  syllable  that  was  not  syllable  2.  Syllable  2  occurred  first  in  two  of  the 
three  other  instances  in  which  it  was  paired  with  a  second  syllable  that  was  not  syllable  1 . 

The  patterns  of  certain  call  types  seemed  to  be  most  parsimoniously  explained  by  a 
set  of  shared  and  flexibly  ordered  syllables.  Several  sets  of  call  types  could  be  described  by 
the  simple  addition  of  extra  syllables  to  a  common  base  segment  or  set  of  segments  in  the 
manner  described  in  the  Introduction.  For  example,  call  type  N72  (Figure  5.1)  was 
comprised  of  a  long  and  slowly  ascending  initial  segment  (syllable  5).  Call  types  N72.2  and 
N72.3  contained  syllable  5  followed  by  one  or  two  rapid  higher  frequency  segments  at  the 
end  (a  single  or  double  instance  of  syllable  4,  respectively).  These  three  subtypes  can  be 
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generated  easily  by  variably  arranging  or  repeating  two  LFC  syllables.  Another  example 
involves  call  type  N16.1,  which  was  built  from  two  LFC  syllables  and  two  HFC  syllables 
(Figure  5.1).  N91  was  identically  composed  except  for  an  additional  LFC  syllable  at  the  end 
(Figure  5.1).  In  fact,  the  LFCs  of  N91.2  and  N91.3  (Figure  5.1,  not  used  in  the  analysis  here 
because  they  were  recorded  fewer  than  10  dmes  each)  could  be  synthesized  by  appending 
another  one  or  two  LFC  syllables,  respectively,  to  the  end  of  an  N91. 

5.5  Discussion 

Taking  advantage  of  human  speech  processing  methods,  the  pitch  tracking 
algorithm,  segmentation  algorithm,  and  single  and  mixed  Gaussian  models  were  successfully 
adapted  and  applied  here  to  Norwegian  killer  whale  vocalizations.  Human  speech  can  be 
broken  into  words  or  phonemes  and  though  each  division  is  meaningful  from  a  production 
perspective,  the  phonemes  afford  improved  speech  recognition  algorithms  on  large 
vocabularies  in  particular  (see  Lee  et  al.,  1989)  and  are  relevant  for  our  perception  of 
consonants  and  vowels  (Furui,  1986).  Analogously,  though  the  call  type  has  been  viewed 
historically  as  the  basic  unit  of  killer  whale  stereotyped  vocal  production,  I  used  a 
classification  test  to  explore  whether  the  vocal  repertoire  could  be  parsed  and  represented 
just  as  efficiendy  using  smaller  phonological  units.  Compared  to  the  approach  of 
categorizing  whole  call  types,  a  few  pieces  of  evidence  from  this  study  support  the  notion 
that  a  syllabic  inventory  could  provide  a  set  of  simpler  shared  components  that  the  killer 
whales  use  to  assemble  at  least  some  of  their  calls.  In  particular,  1)  equivalent  classification 
efficiencies  were  achieved  when  the  sorting  was  based  on  either  the  non-segmented  whole 
call  type  or  the  syllabic  divisions,  2)  nearly  75%  of  all  stereotyped  calls  contained  at  least  one 
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syllable  shared  across  calls,  and  3)  the  set  of  stereotyped  syllables  provided  reasonable 
matches  for  many  of  the  variable  calls. 

First,  the  results  indicated  that  temporal  and  spectral  representations  of  calls  in  their 
entirety  (by  the  whole  contour  experiment  (WCE))  or  as  sets  of  either  distinct  (by  the 
unshared  segment  experiment  (USE))  or  shared  (by  the  shared  segment  experiment  (SSE)) 
syllables  achieved  similar  performance  when  classifying  these  calls  to  type  as  judged  by 
human  observers  (Figure  5.8).  All  three  experiments  provided  equivalently  robust  means  of 
characterizing  stereotyped  calling  behavior.  Compared  to  the  WCE  and  USE,  the  SSE  relied 
on  a  reduced  amount  of  information  to  perform  the  classification  task.  In  particular,  the 
same  dataset  was  compressed  into  a  smaller  number  of  polynomial  representations  for  the 
shared  syllables  in  the  SSE  (N  =  26)  versus  the  set  of  entire  call  types  (N  =  31)  or  distinct 
syllables  (N  =  62).  Because  each  unit  was  characterized  by  6  parameters,  the  SSE  required 
considerably  fewer  features  (i.e.,  156)  than  the  other  two  experiments  (186  for  WCE  and  372 
for  USE),  reducing  the  computation  time  considerably.  These  shared  syllables  still  classified 
calls  to  type  as  effectively  as  the  WCE  and  USE  characterizations,  suggesting  that  this  loss  of 
information  may  have  been  offset  by  a  simpler  and  more  efficient  alternative  system  of 
representation.  Computer  scientists  have  moved  to  a  similar  approach  for  human  speech 
because  improved  recognition  can  be  achieved  using  phonemes  instead  of  words.  (Of 
course,  there  remains  the  downstream  need  to  derive  words  from  these  assembled 
phonemes.) 

As  described  in  the  Introduction,  chunking  is  the  process  by  which  information  is 
segmented  to  relax  the  cognitive  demands  associated  with  retention  and  recall.  The  WCE 
and  SSE  proposed  that  a  killer  whale  could  construct  its  entire  stereotyped  vocal  repertoire 
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either  by  storing  a  large  number  of  whole  call  types  or  by  rearranging  a  smaller  set  of 
archived  shared  syllables,  respectively.  (The  nearly  tripled  size  of  the  syllable  count  when 
each  was  considered  distinct  instead  of  shared  rendered  the  USE-based  representation  less 
likely  from  a  chunking  perspective.  In  particular,  chunking  facilitates  recall  by  decomposing 
information  into  a  set  of  constituent  parts.  With  fewer  parts,  less  memory  must  be  devoted 
to  store  the  pieces  required  to  represent  and  understand  the  whole  repertoire.  The  SSE 
demands  only  a  third  of  the  information  used  by  the  USE,  permitting  a  more  condensed  and 
efficient  representation.)  The  SSE  approach  supports  the  viewpoint  that  killer  whale  calls 
can  be  chunked  into  fewer  and  simpler  phonological  vocal  units,  which  can  generate  the 
same  repertoire  defined  by  the  larger  and  more  complex  set  of  whole  call  type  contours  used 
in  the  WCE.  As  mentioned  above,  the  reduction  in  memory  load  afforded  by  the  smaller 
shared  syllable  count  may  be  offset  by  the  need  to  retain  the  rules  used  to  reconstitute  the 
repertoire  from  these  syllables. 

Second,  the  11  shared  LFC  and  HFC  syllables  identified  here  were  shared  and 
reordered  to  generate  a  variety  of  different  call  types  (Figure  5.1 1).  The  duration,  Legendre 
polynomial  coefficients  and  RMS  frequency  modulation  score  all  determined  which 
segments  sorted  together  and  merited  collapse  into  a  single  category.  Most  call  types  were 
built  from  at  least  one  of  the  shared  syllables,  supporting  the  notion  that  many  of  the  calls 
were  constructed  from  a  set  of  common  syllabic  units.  Indeed,  18  of  the  31  call  types  were 
comprised  entirely  of  shared  syllables  and  another  5  contained  at  least  one  shared  syllable 
(together  constituting  53%  of  the  calls,  Table  5.4).  Certain  patterns  of  syllable  usage 
emerged.  Some  syllables  were  used  much  more  frequently  than  others,  including  LFC 
syllable  4  and  HFC  syllables  1  and  2.  Distinct  call  types  resulted  when  the  same 
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arrangements  of  HFC  syllables  were  paired  with  different  combinations  of  LFC  syllables 
(and  vice  versa).  When  combined  in  tandem,  3  shared  LFC  and  2  shared  HFC  syllables  were 
produced  at  the  same  position  within  a  sequence,  consistently  beginning  or  ending  multiple 
call  types.  For  example,  when  paired,  HFC  syllable  2  always  preceded  HFC  syllable  1. 
Syllables  could  be  ordered  more  flexibly  as  well:  2  shared  LFC  and  1  shared  HFC  syllables 
were  produced  at  different  positions  within  a  sequence.  These  observations  may  form  the 
outlines  of  a  phonological  syntax-based  rule  system  in  which  syllables  are  arranged  in  certain 
orders  and  combinations  only,  but  further  work  is  required  to  test  this  hypothesis. 

The  two  examples  given  in  the  results  (i.e.,  N72,  N72.2,  N72.3  and  N16.1,  N91, 
N91.2,  N91.3)  suggest  a  system  in  which  new  call  types  can  be  generated  by  concatenating 
additional  syllables  and  interspersing  them  with  periods  of  silence  (the  former)  or  stringing 
them  together  as  a  continuous  vocalization  (the  latter).  This  study  does  not  offer  proof  that 
the  killer  whales  were  actually  creating  their  calls  in  this  manner.  (Training  captive  killer 
whales  to  synthesize  calls  by  serially  producing  components  heard  from  a  loudspeaker 
and/ or  to  decompose  playback  calls  by  producing  the  set  of  constituent  segments  would 
certainly  offer  important  supporting  evidence.)  Such  a  system  could,  however,  flexibly  yield 
the  size  and  kind  of  repertoire  produced  by  these  animals.  In  addition,  new  call  types  could 
be  fashioned  from  existing  call  types  simply  by  adding,  deleting  or  reordering  syllables.  One 
of  the  basic  patterns  witnessed  here  indicated  that  the  LFCs  could  be  formed  by  linking 
successively  longer  strings  of  syllables  together  whereas  the  dominant  HFC  syllable 
combination  was  conserved  across  multiple  call  types. 

Eight  call  types  were  constructed  entirely  from  a  set  of  unique  syllables  that  were  not 
shared  with  any  other  type.  Because  the  shared  syllables  were  formed  from  distinct 
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segments  that  confused  with  one  another  when  they  were  sorted,  additional  consolidation 
would  have  produced  an  even  smaller  number  of  unique  units  and  consequendy  a  greater 
number  of  call  types  formed  from  the  set  of  shared  units.  A  tradeoff  emerged  since  further 
syllable  collapse  conflated  previously  distinct  call  types  by  characterizing  them  with  identical 
internal  syllabic  orderings,  reducing  the  total  number  of  call  types.  This  provided  an 
effective  lower  bound  on  the  total  number  of  shared  syllables,  limiting  the  extent  to  which 
the  segments  should  be  collapsed. 

Third,  nearly  half  of  the  variable  calls  matched  a  stereotyped  syllable  with  a  score 
that  rivaled  at  least  10%  of  repeated  self-sorts  of  the  actual  syllable  (Table  5.3,  Figure  5.10). 

In  other  words,  many  of  the  variable  calls,  which  generally  have  been  investigated  separately 
from  stereotyped  calls  or  dismissed  altogether  from  analyses,  sorted  into  the  syllable 
categories  generated  from  the  stereotyped  repertoire.  This  suggests  that  variable  calls  mav 
differ  less  from  stereotyped  calls  than  previously  thought  and  that  many  variable  calls  may 
represent  different  arrangements  of  the  same  phonological  segments  as  are  found  in 
stereotyped  calls. 

The  prospect  that  killer  whales  build  their  calls  from  smaller  subunits  is  reinforced  by 
the  observation  that  compound  calls  can  be  constructed  from  whole  stereotyped  calls 
(Chapter  3;  Strager,  1993;  1995).  This  suggests  a  nested  system  of  vocal  production  in  which 
similar  rules  of  flexible  sequencing  assemble  syllables  into  call  types,  which  can  then  be 
assembled  into  compound  calls.  Certain  syllables  (e.g.,  1  and  2)  and  call  types  (e.g.,  N66  and 
N67)  participated  more  frequently  in  these  assemblages  than  others.  A  large  portion  of  the 
killer  whale  vocal  repertoire  can  be  defined  by  a  system  that  flexibly  generates  new  call  types 
from  a  finite  set  of  components  but  employs  only  a  subset  of  the  possible  combination  of 


165 


these  segments.  This  kind  of  vocal  structure  of  smaller  subunits  building  the  repertoire  is 
consistent  with  analyses  conducted  on  Alaskan  resident  and  transient  killer  whale 
stereotyped  calls  (Yurk,  2005)  and  on  humpback  whale  song  (Payne  &  McVay,  1971;  Payne 
et  ah,  1984;  Suzuki  et  ah,  2006),  suggesting  that  it  may  characterize  the  vocal  regimes  of  a 
broader  array  of  marine  mammal  populations  and  species  than  previously  anticipated. 

Generally,  a  syllable  included  a  set  of  traces  that  occupied  a  continuous  time  and 
frequency  space.  For  example,  syllable  8  was  defined  by  segments  gathered  from  three  call 
types  (N32,  N64  and  N89)  that  lasted  ~ls  and  ranged  between  500  Hz  and  1  kHz.  A  few 
syllables  were  formed  from  call  types  whose  contributions  to  the  class  segregated  noticeably, 
causing  larger  amounts  of  temporal  and/or  spectral  spread  (Figure  5.1 1).  This  was  due  in 
part  to  the  multivariate  representation  of  each  contour  since  a  syllable  category  could  have 
been  formed  from  contours  that  shared  a  subset  of  the  6  temporal  and  spectral  features  used 
to  determine  collapse  into  a  single  class.  Context  may  have  influenced  this  variability  as  well. 
Most  of  the  segments  comprising  syllable  4,  for  instance,  were  short  and  produced  either 
separately  or  between  periods  of  silence.  Segments  16.12  and  91.02,  however,  were  the 
longest  contributors  to  syllable  4  and  both  were  vocalized  without  interruption  after  an 
initial  syllable.  The  immediate  vocal  context  and  position  of  a  segment  within  a  call  type 
may  have  influenced  its  production  and  caused  some  of  the  variability  observed  in  Figure 
5.1 1  (see  Pols,  1986  for  a  human  speech  analog  in  which  vocal  context  and  transition 
influence  the  acoustic  structure  of  phonemes  and  how  they  are  perceived).  Finally,  each 
segment  was  treated  as  a  unified  set  that  was  subject  to  collapse  with  other  segments.  Both 
outliers  and  traces  that  approximated  the  mean  of  a  syllable’s  distribution  more  closely  were 
bundled  together  during  collapse.  Segment  72.33  in  syllable  4,  for  example,  was 
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characterized  by  considerable  scatter  in  terms  of  mean  frequency  (Figure  5.1 1),  possibly 
arising  from  variability  in  production  at  the  individual  or  group  level. 

A  portion  of  the  stereotyped  calls  from  Pacific  Northwest  resident  and  transient 
killer  whale  matched  successfully  with  the  Norwegian  syllables  identified  here  (Table  5.3, 
Figure  5.10).  This  is  especially  striking  considering  the  high  performance  standard  needed  to 
rank  as  well  as  the  stereotyped  Norwegian  syllables.  There  are  two  alternative  explanations 
for  this  result.  The  first  concludes  that  because  killer  whales  vocalize  within  a  finite 
temporal  and  spectral  range,  it  is  to  be  expected  that  a  certain  proportion  of  signals  will 
overlap  between  populations  by  chance.  The  second  suggests  that,  similar  to  humans,  each 
population  of  killer  whales  uses  a  portion  of  the  common  universal  phonemic  inventory  to 
form  its  own  subset  of  units  to  establish  its  vocal  repertoire.  The  lower  success  rates  of  the 
stereotyped  Pacific  Northwest  calls  compared  with  the  variable  Norwegian  calls  (Table  5.3) 
do  indicate  important  divergent  properties  between  the  populations  that  need  to  be 
considered. 

These  two  hypotheses  will  be  informed  by  further  work  on  how  similar  syllable 
usage  is  among  killer  whale  groups.  The  second  explanation,  for  example,  would  receive 
additional  support  if  the  same  syllable  were  to  undergo  the  kind  of  cultural  drift  across 
multiple  call  types  as  has  been  observed  at  the  level  of  the  call  type  (Deecke  et  al.,  2000). 
Similar  to  human  speech  and  dialect  patterns,  previous  research  has  indicated  that  killer 
whale  vocalizations  change  over  time  as  a  result  of  cultural  changes  and  copying  errors  (see 
Ford,  1991  for  an  early  discussion  of  the  issue  and  Deecke  et  al.,  2000  for  a  demonstration; 
Miller  &  Bain,  2000;  Yurk,  2005).  Cultural  drift  may  also  operate  on  the  syllables  if  they  are 
indeed  the  more  basic  units  of  vocal  production.  Just  as  different  call  types  changed  at 
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different  rates  (Deecke  et  al.,  2000),  the  temporal  or  spectral  properties  of  different  syllables 
may  become  altered  depending  on  their  pattern  or  frequency  of  usage.  Supposing  that  these 
animals  have  production  control  over  the  individual  syllables  separately,  further  work  should 
test  the  rates  and  kinds  of  syllable  modification  that  occur  over  time.  If  shared  syllables  drift 
similarly  across  call  types,  this  would  support  the  view  that  calls  are  composed  of  discrete 
units  subject  to  individual  handling.  This  idea  is  analogous  to  the  manner  in  human  language 
in  which  the  drift  in  production  of  certain  vowels  across  words  can  lead  to  regional  dialects 
and  accents.  Another  experiment  might  involve  training  captive  killer  whales  to  respond  to 
playbacks  of  a  subset  of  calls  by  producing  each  call’s  constituent  segments  in  a  series 
separated  by  brief  silence.  Their  vocal  response  to  a  new  subset  of  call  playbacks  could  then 
be  investigated.  If  the  animals  produced  the  component  segments  of  these  new  calls,  the 
experiment  would  demonstrate  that  the  animals  were  capable  of  parsing  the  vocalizations 
into  syllables  in  a  manner  consistent  with  their  training. 

This  study  introduced  a  variety  of  new  techniques  based  on  the  field  of  human 
speech  recognition  to  analyze  killer  whale  vocal  repertoires.  The  pitch  tracking  algorithm 
used  here  has  been  productively  applied  elsewhere  (Nousek  et  al.,  2006;  Miller  et  al.,  2007), 
demonstrating  its  effectiveness  in  a  variety  of  experimental  contexts.  The  Legendre 
polynomials  offered  a  simple  representation  of  the  contours  that  permitted  robust 
classification  to  the  call  type  categories  previously  defined  by  humans.  The  capacity  to 
represent  each  contour  with  only  6  data  points  marked  a  considerable  improvement  in  the 
amount  of  computation  time  and  memory  required  to  execute  classification  tasks.  The  4th- 
order  Legendre  polynomial  fit  offered  a  compromise  between  reducing  the  number  of  data 
points  required  while  still  achieving  a  fairly  accurate  fit.  It  is  likely  that  reducing  the  order  of 
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the  polynomial  further  (and  therefore  the  number  of  data  points  associated  with  each 
contour)  would  have  contributed  to  a  compromised  ability  to  discriminate  between  call  types 
in  the  experiments  presented  here.  Increasing  the  order,  however,  would  have  required 
longer  computation  time.  Although  a  higher  order  would  have  fit  the  discontinuous  traces 
slightly  better,  it  would  have  introduced  a  set  of  unnecessary  additional  coefficients  for  the 
simpler  continuous  traces  that  may  have  interfered  with  categorization.  Future  work  could 
consider  this  issue  more  closely  by  evaluating  classification  performance  across  a  range  of 
orders  of  the  polynomial  fit. 

The  3  experiments  could  only  be  compared  when  each  was  afforded  the  same 
information  to  conduct  the  classification.  All  of  the  results  were  improved  therefore  by 
constraining  the  possible  training  set  matches  according  to  the  number  of  segments  and 
components  in  the  test  contour.  If  these  constraints  were  relaxed  in  the  context  of  a 
different  study,  similar  classification  performance  results  might  be  achieved  by  using  a  higher 
order  Legendre  polynomial  fit.  In  addition,  the  Gaussian  models  provided  a  straightforward 
way  to  summarize  the  contributions  of  numerous  contours  through  a  single  mean  vector  and 
covariance  matrix.  The  speed  and  accuracy  of  these  methods  would  facilitate  real-time  call 
type  classification  and  analyses  requiring  more  elaborate  computations. 

In  the  future,  it  would  be  useful  to  explore  forming  a  full  inventory  of  phonemes 
derived  from  the  variable  calls  alone  and  investigating  the  overlap  between  this  inventory7 
and  the  stereotyped  and  variable  phoneme  inventories  of  both  Norwegian  and  other 
populations.  The  ultimate  test  for  the  legitimacy  of  the  perspective  that  syllables  are  the 
building  blocks  of  killer  whale  vocal  activity  must  come  from  the  animals  themselves. 
Playback  studies  in  captivity  could  test  whether  animals  are  capable  of  discerning  syllables  by 
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evaluating  their  performance  on  syllable  matching  and  discrimination  tasks.  Further  support 
for  syllable  sharing  would  be  offered  by  killer  whales  that,  after  being  trained  to  classify 
stimulus  pairs  as  the  same  or  different,  sorted  syllables  roughly  into  the  categories 
established  here.  This  study  offers  incentive  to  continue  exploring  syntax  in  killer  whales 
analytically  and  experimentally  to  improve  our  understanding  of  how  these  animals  perceive 
and  produce  their  vocal  repertoire. 
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Chapter  6.  Conclusions 


6.1  Framing  comments 

There  has  been  increasing  interest  in  cultural  traits  that  are  transmitted  socially 
through  observation  and  learning  in  animals.  While  social  learning  can  most  easily  be 
demonstrated  in  experiments  with  capdve  animals,  methods  for  gathering  indirect  evidence 
have  been  proposed  for  wild  animals.  One  method  suggests  that  if  ecological  and  genetic 
explanations  can  be  discounted  for  behavioral  differences  between  social  groups  or 
populations  in  the  wild,  such  differences  are  likely  to  have  arisen  via  cultural  transmission. 
For  example,  Whiten  et  al.  (1999)  examined  whether  39  different  behaviors  were  absent, 
occasionally  present  or  frequendy  present  in  6  populations  of  chimpanzees  (Pan  troglodytes) 
across  Africa.  These  authors  concluded  that  the  resulting  unique  behavioral  arrays  arose 
from  culturally-transmitted  differences  between  the  populations.  Although  it  is  not  possible 
to  exclude  genetic  or  ecological  explanations  entirely  (see  Humle  &  Matsuzawa,  2002), 
Whiten  et  al.  (1999)  certainly  brought  more  attention  to  the  discussion  of  animal  culture. 
Two  years  later,  a  lengthy  discourse  on  the  subject  of  culture  in  marine  mammals  was 
published  (Rendell  &  Whitehead,  2001).  Killer  whales  ( Orcinus  orcci)  featured  prominently  in 
this  article  and  the  ensuing  commentary  since  they  exhibit  a  combination  of  three  features 
that  make  the  cultural  acquisition  of  behavior  likely  in  this  species  (see  Chapter  1).  First, 
their  population-distinctive  foraging  strategies  resemble  the  differences  in  feeding  behavior 
between  the  chimpanzee  populations  described  above.  Second,  their  social  structure 
provides  a  set  of  stable  relationships  where  repeated  learning  and  cultural  transmission  of 
behavior  could  occur.  Finally,  killer  whales  produce  group-distinctive  vocal  repertoires  that 
parallel  the  song  repertoires  of  certain  bird  species  and  that  exceed  in  complexity  and 


171 


diversity  the  vocal  behavior  of  non-human  primates  including  chimpanzees. 

Captive  experiments  allow  a  controlled  demonstration  of  social  transmission  and 
observational  learning  (e.g.,  Helfman  &  Schultz,  1984;  Warner,  1988;  Galef,  1992;  White  et 
al.,  2007).  Such  work,  however,  tends  to  be  disconnected  from  the  functional  contexts  of 
behavior  in  the  wild,  which  can  be  difficult  to  track  continuously.  A  hybrid  approach  is 
likely  to  be  the  most  beneficial  in  which  social  learning  studied  in  captivity  is  complemented 
by  explorations  of  behavior  in  the  wild,  an  ecologically-  and  evolutionarily-valid  setting. 

From  a  functional  perspective,  it  is  important  to  understand  the  details  of  group-  or 
population-specific  behavioral  differences  as  a  starting  point  to  search  for  evidence  that  killer 
whales  engage  in  cultural  transmission  and  social  learning.  The  aim  of  my  dissertation  was 
to  focus  on  this  functional  approach  and  use  digital  archival  tags  (DTAGs,  Johnson  & 

Tyack,  2003)  to  examine  the  feeding  and  vocal  behaviors  of  Norwegian  killer  whales  more 
closely.  The  killer  whales  that  have  been  studied  in  this  population  display  carousel  feeding 
behavior  (Christensen,  1978;  Simila  &  Ugarte,  1993)  and  pod-specific  stereotyped  pulsed  call 
repertoires  (Strager,  1993;  1995).  I  explored  the  contributions  of  individual  animals  to 
carousel  feeding  groups,  the  relationships  between  individual  movements  and  group  vocal 
activity  and  the  syntax  of  pulsed  calls.  My  approach  characterizes  a  constellation  of 
behaviors  that  can  be  used  in  a  manner  similar  to  the  behavioral  array  employed  by  Whiten 
et  al.  (1999).  In  particular,  a  set  of  companion  studies  should  be  designed  in  the  future  to 
examine  the  detailed  movement,  feeding  and  vocal  behaviors  between  killer  whale 
populations.  These  inquiries  will  provide  a  foundation  for  describing  both  the  shared  and 
divergent  patterns  that  might  be  inherited  or  learned. 

Once  the  details  of  these  behaviors  are  mapped  for  at  least  a  handful  of  populations, 
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it  is  important  to  examine  their  acquisition  by  exploring  their  transmission.  To  prove  social 
learning,  it  is  first  necessary  to  document  the  frequency  and  type  of  opportunities  in  which 
naive  individuals  are  exposed  to  more  experienced  individual(s)  performing  the  behavior  in 
question.  The  crucial  set  of  experiments  would  then  involve  tracing  the  transmission  and 
execution  of  either  the  very  behaviors  that  distinguish  different  populations  of  killer  whales 
or  some  arbitrary  and  novel  behavior  used  by  a  particular  group.  The  kind  of  longitudinal 
effort  conducted  on  single  individuals  from  birth  until  they  can  perform  the  behaviors  in 
question  is  certainly  served  well  by  work  done  in  captivity.  This  would  allow  a  constant 
monitoring  of  the  social  and  physical  environment  to  ascertain  whether  and  which  external 
phenomena  contribute  to  learning  a  behavior.  Genetic  explanations  might  still  linger  if  these 
behaviors  are  transmitted  primarily  among  related  individuals.  Alternatively,  the  wild 
provides  a  more  natural  experimental  context  but  is  much  less  controlled.  It  would  be 
possible  to  introduce  a  novel  behavior  or  vocalization  (see  Richards  et  al.,  1984)  to  a  free- 
ranging  population  of  animals  and  examine  whether  and  how  easily  it  is  incorporated  by  a 
naive  individual  into  its  repertoire.  Captive  studies  would  also  permit  a  seeding  of  different 
behaviors  in  different  individuals  (e.g.,  see  Horner  et  al.,  2006;  Hopper  et  al.,  2007  for 
comparable  chimpanzee  studies),  which  would  allow  subsequent  examination  of  which  of 
these  behaviors  a  new  calf  eventually  adopts. 

Such  work  would  be  complemented  by  studies  in  the  wild  of  behavioral  acquisition 
but  it  may  be  hard  to  observe  an  individual  over  the  months  or  years  necessary,  especially 
given  the  restrictions  on  tagging  very  young  animals.  However,  a  combined  tracking  of 
social  association  with  the  dissemination  of  a  novel  behavior  or  behavioral  modification  in 
the  wild  would  allow  an  assessment  of  whether  social  learning  and  cultural  transmission 
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occur  naturally.  Such  a  process  was  documented  for  the  spreading  of  lobtail  feeding  among 
humpback  whales  ( Megaptera  novaeangliae )  in  New  England  waters  (Weinrich  et  al.,  1992). 
Therefore,  an  improved  understanding  of  the  movement  and  vocal  phenomena  as  I  have 
presented  them  in  this  dissertadon,  coupled  with  future  work  on  their  ontogeny  and 
transmission,  will  help  characterize  the  function  of  culture  and  social  learning  with  respect  to 
certain  behaviors  in  killer  whales. 

6.2  Chapter  synopses 
6.2.1  Chapter  2 

Killer  whales  display  a  wide  diversity  of  foraging  strategies  and  feeding  preferences 
(see  Chapter  1),  which  are  influenced  by  their  socially  gregarious  nature  and  the  ecology  and 
prey  availability  of  their  habitat.  In  particular,  Norwegian  killer  whales  feed  in  groups  on 
herring  ( Clupea  harengus)  via  carousel  feeding,  a  set  of  behaviors  that  involves  herding  the  fish 
from  depth,  corralling  them  into  a  ball  that  is  trapped  against  the  surface,  individually  tail 
slapping  the  edge  of  the  ball  and  consuming  the  fish  one  by  one  (Christensen,  1978;  Simila 
&  Ugarte,  1993;  Domenici  et  al.,  2000;  Nottestad  et  al.,  2002;  Simon  et  al.,  2005).  Previous 
research  has  restricted  its  exploration  of  carousel  feeding  to  the  group  level  because  of  the 
general  difficulties  associated  with  tracking  individual  marine  mammals. 

As  outlined  earlier,  one  goal  of  my  dissertation  was  to  probe  how  individual  killer 
whales  participated  in  carousel  feeding  groups.  The  central  aim  of  Chapter  2  therefore  was 
to  utilize  the  tag  data  from  individual  animals  to  reconstruct  as  complete  a  spatial  and 
temporal  portrait  as  possible  of  carousel  feeding  activity.  Using  the  acoustic  signature  of  the 
percussive  tail  slap,  the  dataset  was  divided  into  two  primary  behavioral  states:  tail  slapping 
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(TS)  and  not  tail  slapping  (NTS)  periods.  Tail  slaps  produced  by  the  tagged  animal  were 
identified  by  a  sudden  change  in  the  pitch  signal  as  the  animal  moved  from  poindng 
downwards  to  upwards.  The  diving  and  movement  details  preceding  and  following  a  focal 
tail  slap  appeared  to  be  controlled  primarily  by  the  posidon  and  shape  of  the  fish  ball. 

Group  tail  slapping  rates  were  higher  than  previously  characterized  and  individual  tail 
slapping  rates  during  carousel  feeding  varied  considerably,  suggesting  either  inter-individual 
or  inter-carousel  differences.  TS  episodes  were  characterized  by  a  higher  proportion  of  time 
spent  at  shallow  depths  (between  5  and  25m)  and  included  or  were  immediately  preceded  by 
the  deepest  dives  recorded  by  the  tags.  NTS  periods,  however,  contained  longer  inter-dive 
intervals  and  slower  excursions  to  depth.  The  variability  in  the  measurements  of  the  vertical 
velocity,  fluking  intensity,  change  in  pointing  angle  and  change  in  roll  were  elevated  during 
TS  compared  to  NTS  episodes.  These  results  were  sensible  since  carousel  feeding  required 
active  and  consistent  maneuvering  about  the  herring  at  shallow  depths  once  the  fish  had 
been  brought  to  the  surface. 

I  sought  to  differentiate  the  tight  circling  behavior  associated  with  carousel  feeding 
behavior  in  the  literature  from  other  periods  of  behavior.  A  linearity  index  (LI)  was 
computed  to  measure  the  relative  directedness  or  circuitousness  of  travel.  TS  episodes 
tended  to  be  characterized  by  low  LI  values  that  resulted  from  the  active  circling  of  the 
animals  to  maintain  proximity  to  or  perhaps  control  the  position  of  the  fish  ball.  In  contrast, 
NTS  episodes  were  defined  by  higher  LI  values  and  thus  more  directional  travel.  One  of 
two  categories  of  behavioral  sequence  preceded  the  TS  periods.  The  first  resembled  the 
pattern  described  previously  in  the  literature  in  which  broad  horizontal  looping  and 
occasional  deep  dives  by  the  animals  gave  way  to  the  tighter  looping  and  focal  and  non-focal 
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tail  slapping  activity  that  typified  carousel  feeding.  These  behaviors  implied  a  behavioral 
flow  from  putative  corralling  to  tail  slapping  and  eating.  In  the  second  type  of  sequence,  the 
killer  whales  transitioned  abrupdy  from  direct  straight  line  travel  to  carousel  feeding, 
suggesting  that  they  converged  upon  a  group  of  fish  that  had  already  coalesced  at  the  surface 
likely  as  the  result  of  the  corralling  efforts  of  another  group.  Together,  these  results  inform 
the  dynamic  and  opportunistic  nature  of  carousel  feeding,  a  habitat-specific  and  likely 
culturally  evolved  foraging  tradition. 

6.2.2  Chapter  3 

An  index  of  association  is  commonly  used  in  animal  behavior  to  describe  the 
strength  of  a  social  relationship.  Such  indexes  for  marine  mammals  have  traditionally  been 
limited  to  sightings  of  animals  that  surface  at  the  same  time  and/or  in  the  same  location. 

The  extent  to  which  individuals  overlap  in  space  and  time  varies  based  on  the  behavioral 
context  of  the  animals  and  the  definitions  and  protocols  established  by  different  research 
studies  (e.g.,  Wiirsig,  1978;  Ballance,  1990;  Bigg  et  al.,  1990;  Smolker  et  al.,  1992).  In 
addition,  surface  behavior  is  obviously  limited  to  a  set  of  intermittendy-sampled  2D 
observations  acquired  from  animals  that  maneuver  and  interact  in  3D. 

Chapter  3  explored  a  more  detailed  analysis  of  the  association  patterns  of  two  pairs 
of  simultaneously  tagged  killer  whales.  The  work  was  intended  to  challenge  the  frequent 
assumption  in  the  marine  mammal  literature  that  proximity  or  synchrony  at  the  surface 
automatically  translates  to  similar  behaviors  at  depth.  More  specifically,  the  depth  sensors 
on  the  tags  afforded  an  opportunity  to  examine  vertical  association  throughout  the  water 
column.  A  juvenile  and  adult  female  that  were  traveling  within  a  larger  group  were  tagged 
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simultaneously  and  they  traced  very  similar  vertical  profiles  as  they  surfaced  and  dove  in 
tight  synchrony.  These  animals  were  never  one  another’s  nearest  neighbor,  however, 
suggesting  that  the  entire  social  group  was  ascending  and  descending  in  unison  through  the 
water  column.  The  vertical  excursions  of  a  second  pair  of  simultaneously  tagged  animals,  an 
adult  female  and  male  killer  whale  within  a  carousel  feeding  group,  were  unlinked,  however. 
They  occasionally  overlapped  but  primarily  dove  out  of  phase  with  each  other. 

An  important  conclusion  drawn  from  this  chapter  was  that  the  presence  or  absence 
of  vertical  association  in  particular  (and  3D  association  more  generally)  may  have  been 
managed  more  by  the  behavioral  context  of  the  individuals  than  by  their  social  relationship. 
The  group  of  animals  in  which  the  first  killer  whale  pair  was  observed  was  silent  as  they 
swam,  implying  that  they  may  have  been  relying  on  visual  or  passive  acoustic  cues  to 
maintain  their  formation.  During  carousel  feeding,  by  contrast,  synchronous  occupation  of  a 
depth  layer  by  a  group  of  animals  could  lead  to  the  evacuation  and  escape  of  the  corralled 
herring  ball.  Coordination  of  this  foraging  sequence  requires  a  decoupling  of  the  vertical 
profiles  to  keep  the  herring  trapped,  allowing  individual  animals  to  break  rank  one  or  two  at 
a  time  to  tail  slap  the  ball  and  feed.  Ultimately,  when  designing  metrics  of  association,  it  is 
most  important  to  consider  the  movements,  relationships  and  behaviors  that  are  functionally 
and  socially  relevant  to  the  study  animals. 

6.2.3  Chapter  4 

When  investigating  the  function(s)  of  the  vocal  activity  of  killer  whales,  most 
research  has  matched  overall  changes  in  calling  rate  of  a  group  with  different  behavioral 
states.  Pulsed  calls  may  relate  to  arousal  level  (a  possible  explanation  for  the  elevated  calling 
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rates  of  transient  killer  whales  after  a  kill,  Deecke  et  al.,  2005),  function  as  an  acoustic  badge 
for  group  membership  (Ford,  1989),  provide  an  orientation  cue  to  conspecifics  (Miller,  2002) 
or  allow  animals  to  maintain  contact  with  one  another  through  matched  counter-calling 
(Miller  et  al.,  2004c).  In  only  one  study  was  a  single  call  type,  N21,  paired  predominantly 
with  a  particular  behavior  (Van  Opzeeland  et  al.,  2005).  In  general,  however,  the  null 
hypothesis  that  killer  whales  produce  and  use  their  call  types  interchangeably  has  remained 
unchallenged. 

This  claim  was  explicitly  tested  in  Chapter  4  by  searching  for  coincident  call  type 
usage  and  movement  behavior  patterns.  During  NTS  periods,  calling  bouts  were 
consistently  shallower  and  characterized  by  less  variable  vertical  velocity  than  bouts  lacking 
vocal  activity.  I  investigated  whether  the  vocal  behavior  associated  with  either  putative 
corralling  or  putative  travel  preceding  tail  slapping  (described  earlier  in  Chapter  2)  could  be 
differentiated.  Such  a  result  might  inform  the  function  of  and  justify  the  distinction  between 
these  two  sequences.  The  first  category  of  broad  looping  consistent  with  putative  corralling 
behavior  was  accompanied  by  high  rates  of  vocal  activity  in  all  4  instances,  which  may 
indicate  a  social  communication  function.  However,  two  of  the  three  instances  of  the 
second  category  of  directional  travel  contained  no  calls.  Visual  observations  were  available 
for  one  of  these  cases,  which  documented  the  tagged  (silent)  group  converging  on  another 
group  of  whales  that  was  already  engaged  in  feeding.  This  suggests  that  killer  whales  may 
eavesdrop  to  locate  neighboring  conspecifics  that  are  acoustically  active  (producing  pulsed 
calls  and  tail  slaps)  as  they  feed  on  herring  that  have  already  been  corralled  to  the  surface. 

We  do  not  know  the  exact  costs  and  benefits  of  attracting  other  whales  to  a  carousel  but  the 
foraging  animals  do  produce  loud  and  percussive  tail  slaps,  which  would  disclose  their 
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location  regardless. 

I  split  the  call  types  into  two  broad  categories  according  to  whether  they  were 
characterized  by  a  low  frequency  component  alone  (LFC)  or  low  and  high  frequency 
components  together  (L/HFC).  1  predicted  that  TS  periods  would  contain  higher  counts  of 
L/HFC  calls  since  these  call  types  were  more  likely  to  offer  orientation  cues  to  conspecifics 
(Miller,  2002)  than  LFC  calls.  Group  members  may  have  used  this  information  to  help 
maintain  the  coherence  of  the  herring  ball  for  feeding.  The  results  confirmed  this  prediction 
since  5  of  the  7  recordings  contained  more  L/HFC  (and  less  LFC)  calls  than  expected 
during  TS  periods  and  more  LFC  (and  less  L/HFC)  calls  than  expected  during  NTS  periods. 
Significant  differences  in  rate  of  call  type  production  were  observed  for  all  7  animals 
showing  both  TS  and  NTS  episodes.  For  six  of  the  whales,  the  three  call  types  contributing 
most  to  this  significant  difference  followed  the  trend  just  observed  in  which  LFC  call  types 
were  more  common  during  NTS  periods  and  L/HFC  call  types  during  TS  periods.  These 
observations  were  consistent  with  the  notion  that  certain  call  types  were  used  preferentially 
during  particular  behaviors,  providing  evidence  against  the  claim  that  call  types  were 
produced  interchangeably.  No  relationship  between  call  type  and  the  raw  movement 
measures  was  discovered,  suggesting  that  either  such  an  association  does  not  actually  exist  or 
that  the  movement  data  did  not  capture  the  relevant  behavioral  or  environmental  data  that 
triggered  particular  vocal  activity. 

6.2.4  Chapter  5 

An  elucidation  of  the  basic  units  of  vocal  production  yields  insight  into  how  the 
brain  archives  and  builds  acoustic  communication  signals  in  animals.  In  general,  such 
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vocalizations  have  been  considered  large  and  independent  units.  For  example,  most  studies 
have  focused  on  the  pulsed  call  as  the  fundamental  unit  of  killer  whale  vocal  production 
(Ford,  1987;  Strager,  1993;  Filatova  et  al.,  2004).  An  alternative  approach  might  search  for  a 
set  of  shared  vocal  subunits  that  comprise  these  stereotyped  and  variable  calls  (see  Yurk, 
2005  for  an  analysis  involving  syllable  divisions).  Indeed,  advances  in  digital  signal 
processing  and  a  conceptual  move  from  using  whole  words  to  phonemes  in  human  speech 
(Lee  et  al.,  1989)  have  afforded  improved  performance.  These  results  suggest  that  animal 
vocal  repertoires  might  benefit  from  similar  analytical  consideration.  Such  an  approach 
provides  an  important  step  towards  exploring  whether  animal  communication  signals  could 
be  generated  by  recombining  and  rearranging  a  small  and  finite  set  of  subunits.  In  Chapter 
5, 1  used  human  speech  processing  techniques  to  explore  whether  a  set  of  shorter  segments 
shared  across  Norwegian  killer  whale  call  types  operated  as  efficiently  as  whole  calls  to 
classify  vocalizations  to  call  type. 

A  pitch  tracking  algorithm  developed  for  human  telephone  speech  (Wang  &  Seneff, 
2000;  Wang,  2001)  was  successfully  applied  to  trace  the  fundamental  frequencies  of  the  killer 
whale  pulsed  calls.  Calls  were  segmented  based  on  brief  gaps  of  silence  or  abrupt  spectral 
shifts.  Three  experiments  involving  sets  of  6  temporal  and  spectral  summary  measurements 
were  designed  to  test  call  type  classification  efficiency.  The  whole  call  experiment  (WCE) 
operated  under  the  traditional  view  that  considered  the  entire  contour  as  the  entity  for 
sorting.  The  contours  were  then  divided  into  segments  that  were  either  completely  call- 
specific  (unshared  segment  experiment,  USE)  or  allowed  to  share  between  call  types  (shared 
segment  experiment,  SSE).  The  categorization  results  were  statistically  equivalent  across 
these  three  experiments.  The  SSE  approach  required  the  least  amount  of  information  (N  = 
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26  segments  versus  N  -  31  whole  call  types  and  N  =  62  distinct  syllables)  to  perform  at  the 
same  level  of  accuracy  as  the  other  experiments.  The  result  that  SSE  required  less 
information  to  classify  calls  just  as  successfully  suggests  that  it  is  a  more  parsimonious 
method  of  representing  the  vocal  behavior  of  these  animals  (see  Lee  et  al.,  1989).  Nearly 
75%  of  the  call  types  contained  at  least  one  of  the  shared  syllables,  suggesting  that  many  of 
the  vocalizations  were  drawn  from  a  common  inventory  of  sounds.  The  LFCs  were  often 
built  by  concatenating  different  strings  of  syllables  whereas  the  predominant  HFC  syllable 
sequence  was  shared  across  many  call  types. 

Variable  pulsed  calls  have  generally  been  viewed  as  a  miscellaneous  vocal  category 
that  lacks  a  relationship  with  the  stereotyped  calls.  In  addition,  stereotyped  calls  between 
populations  are  considered  unrelated.  These  assumptions  were  not  supported  by  my 
observation  that  Norwegian  variable  pulsed  calls  and  resident  and  transient  stereotyped  calls 
from  the  Pacific  Northwest  all  overlapped  to  differing  degrees  with  the  inventory  of 
segments  derived  from  the  Norwegian  stereotyped  calls.  The  Norwegian  variable  calls 
demonstrated  the  closest  overlap  with  the  stereotyped  calls,  implying  that  variable  calls  may 
constitute  unique  combinations  of  the  phonological  segments  derived  from  the  stereotyped 
utterances.  The  small  number  of  matches  between  the  stereotyped  Norwegian  calls  and  the 
Pacific  Northwest  calls  arose  either  because  killer  whales  from  different  populations  vocalize 
within  the  same  temporal  and  spectral  space  and  call  types  will  naturally  overlap  by  chance 
or  because  these  different  populations  actually  generate  their  vocalizations  from  a  common 
syllabic  inventory.  Given  the  earlier  discussion  of  cultural  transmission,  it  is  possible  that 
killer  whales  do  not  learn  the  phonemes  they  produce  but  rather  have  a  larger  set  of  possible 
syllables  at  birth  that  are  subsequently  pruned  with  experience  to  allow  the  generation  of 
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their  particular  repertoire,  a  phenomenon  called  selection-based  learning  (Nelson  &  Marler, 
1994).  However,  the  actual  sequences  and  patterns  of  syllables  may  still  be  socially  learned 
and  culturally  acquired. 

6.3  General  conclusions 

I  executed  two  new  approaches  to  studying  free-ranging  killer  whales  in  my 
dissertation.  First,  the  digital  tagging  technology  afforded  a  novel  opportunity  to  track  the 
movements  of  individual  killer  whales  as  they  engaged  in  group  feeding  and  vocal  behaviors. 
The  3D  orientation  and  position  information  yielded  by  the  movement  sensors  allowed  both 
gross  comparisons  of  measurement  means  and  variability  during  different  behavioral  states 
and  fine  examinations  of  pitch  changes  during  focal  tail  slaps.  These  data  demonstrated  a  set 
of  phenomena  across  the  whales  including  a  highly  circuitous  path,  more  variable 
movements  and  a  tail  slap-induced  abrupt  pitch  change  during  carousel  feeding.  Visual 
observations  at  the  surface  could  not  have  provided  information  on  individual  tail  slapping 
rates  or  continuously  tracked  an  animal  once  it  left  the  surface.  The  movement  analysis 
uncovered  an  important  dichotomy  in  the  behavioral  sequences  anticipating  carousel 
feeding.  One  scenario  upheld  the  earlier  description  of  looped  corralling  at  depth  giving  way 
to  the  carousel  while  the  other  suggested  a  more  direct  convergence  on  a  school  of  herring 
that  had  already  been  corralled. 

The  addition  of  group  vocal  behavior  data  reinforced  this  distinction  since 
heightened  vocal  behavior  accompanied  the  corralling  in  the  first  scenario  but  generally  not 
the  directed  travel  in  the  second.  Vocal  activity  certainly  accompanies  the  foraging  behavior 
of  numerous  killer  whale  populations  (e.g.,  Ford,  1989;  Deecke  et  al.,  2005;  Van  Opzeeland 
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et  al.,  2005)  and  may  function  to  help  coordinate  social  feeding  behavior.  During  corralling 
and  carouseling,  Norwegian  killer  whales  may  use  stereotyped  L/HFC  calling  to  convey 
additional  information  about  the  location  and  orientation  of  conspecifics.  A  carousel 
feeding  individual  could  keep  track  continuously  of  the  positions  of  the  other  group 
members  by  monitoring  their  incessant  vocal  activity.  In  contrast,  the  silence  of  the  directed 
travel  in  2  of  the  3  instances  could  be  explained  in  a  variety  of  ways  (e.g.,  the  animals  were 
not  interested  in  foraging,  they  were  eavesdropping  on  conspecifics,  they  were  foraging 
silently,  they  remained  silent  to  prevent  their  prey  from  detecting  them,  etc.).  Both  the 
abrupt  changes  in  direction  that  were  sometimes  observed  and  the  actual  convergence  on  at 
least  one  occasion  of  the  traveling  group  with  another  carousel  feeding  group  reinforce  the 
eavesdropping  hypothesis.  During  these  periods,  the  silent  group  was  likely  coordinating 
their  movements  and  behaviors  visually. 

Second,  1  conducted  a  set  of  experiments  that  profited  from  human  speech 
processing  algorithms  and  approaches.  The  field  of  speech  recognition  productively 
implements  phonemic  instead  of  whole  word  representations  of  human  language  (see  Lee  et 
al.,  1989),  and  a  similar  approach  was  employed  here.  The  results  were  consistent  with  the 
notion  that  killer  whales  may  assemble  their  stereotyped  and  variable  calls  from  a  shared  set 
of  segments.  If  this  finding  can  be  confirmed,  it  would  help  guide  our  understanding  of  how 
these  animals  produce,  process  and  store  in  memory  their  pulsed  calls.  For  example,  instead 
of  archiving  entire  call  types,  these  animals  may  store  individual  syllables  and  their  various 
orderings  to  generate  call  repertoires.  They  might  produce  variable  calls  simply  by 
rearranging  these  syllables  into  non-typical  sequences. 


183 


6.4  Future  directions 


There  are  several  directions  for  future  work.  One  important  advance  involves 
exploring  whether  cooperation  and  role  playing  or  turn  taking  is  occurring  during  carousel 
feeding  or  other  behaviors.  More  than  two  simultaneous  tagouts  will  provide  important  data 
about  how  these  animals  position  and  orient  themselves  relative  to  one  another  as  they 
carousel  feed  and  whether  they  role  play  and/ or  take  turns  as  they  corral  and  tail  slap  the 
fish.  Role  specialization  is  one  way  in  which  animals  trying  to  solve  a  problem  can 
coordinate  their  efforts.  Some  animals  display  highly  specialized  roles  (e.g.,  lions  ( Panthera 
leo ),  Packer  et  al.,  1990)  while  others  engage  in  turn-taking  behaviors  in  which  animals  cycle 
through  behaviors  to  complement  one  another  (e.g.,  Dusky  dolphins  ( [Lagenorhynchus  obscurus ), 
Wiirsig  &  Wiirsig,  1980;  Wiirsig,  1986).  Norwegian  killer  whales  might  take  turns  during 
carousel  feeding,  for  example,  if  different  animals  contained  herring  to  prevent  losing  the 
fish  while  others  tail  slapped  to  consume  the  fish.  It  is  therefore  important  to  characterize 
the  component  behaviors  produced  by  killer  whales  engaged  in  particular  activity  sequences 
and  to  identify  when  different  individuals  display  each  of  the  behaviors  using  multiple 
simultaneous  tag  records. 

In  addition,  experimental  tests  of  observational  learning  are  important  for 
establishing  the  possibility  of  social  transmission  of  behavior  among  killer  whales.  Captive 
studies  might  involve  seeding  a  different  problem-solving  approach  with  two  different 
animals  and  observing  whether  and  how  these  behaviors  propagate  through  the  social  group. 
This  kind  of  study  would  ground  the  work  in  the  wild  in  the  context  of  an  experimental 
demonstration  of  the  presence  or  absence  of  observational  learning  and  cultural 
transmission. 
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It  is  important  to  understand  the  predator-prey  dynamics  and  interactions  of  killer 
whales  and  herring.  Further  work  should  focus  on  how  killer  whales  and  herring  respond  to 
the  movements  of  one  another  spatially  and  temporally.  This  kind  of  integrated  study  could 
explore  whether  a  particular  conformation  of  herring  is  required  before  tail  slapping  is 
initiated.  This  could  be  achieved  by  combining  further  tagging  work  with  a  simultaneous 
monitoring  of  the  herring  position  and  geometry  via  active  sonar  or  video.  A  comparative 
study  involving  other  populations  and  species  of  marine  mammals  that  feed  on  herring 
would  provide  a  useful  perspective  on  convergent  or  distinct  behavioral  strategies  for 
feeding  on  a  mobile  coastal  fish  species.  It  could  also  explore  whether  any  observed 
differences  are  genetic,  ecological  or  culturally  learned. 

Icelandic  orcas  engage  in  carousel  feeding  as  well,  for  example,  but  produce  a  low 
frequency  136  call  that  is  thought  to  corral  the  herring  (Simon  et  al.,  2005;  2006;  2007).  To 
link  the  136  definitively  with  a  corralling  function,  simultaneous  acoustic  recording  and 
herring  monitoring  will  be  required  in  a  preliminary  study  before  conducting  a  playback 
experiment.  Beyond  killer  whales,  humpback  whales  in  southeastern  Alaska  bubble  net  feed 
on  herring  in  groups  and  a  particular  call  reliably  and  immediately  precedes  their  lunging 
ascent  (D’Vincent  et  al.,  1985).  Simultaneous  tagging  and  array  recordings  could  help 
determine  whether  the  call  is  produced  consistently  by  the  same  animal  or  by  the  animal  that 
occupies  a  certain  position  within  the  group’s  3D  spatial  conformation.  Using  this 
integration  of  data  recording  technologies  to  study  the  feeding  behavior  of  free-ranging 
marine  mammals  will  offer  important  comparative  insights  into  how  marine  animals  forage 
in  groups  and  whether  they  cooperate  to  do  so. 

Another  productive  direction  for  future  research  for  this  work  will  require  assigning 
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call  production  to  individual  animals.  This  will  inform  the  nature  of  vocal  interactions 
between  individuals  and  help  determine  whether  certain  call  types  act  as  triggers  for 
behavioral  responses  in  particular  group  members.  The  use  of  a  hydrophone  array  (Miller  & 
Tyack,  1998)  in  the  Pacific  Northwest  was  a  start  in  this  research  direction  as  it  yielded  data 
to  support  the  hypothesis  that  killer  whales  engage  in  matched  counter-calling  by  responding 
to  stereotyped  calls  with  calls  of  the  same  type  (Miller  et  al.,  2004c).  The  results  that  I 
present  in  my  dissertation  are  based  on  the  movements  of  individual  killer  whales  and  the 
vocalizations  of  their  groups.  The  use  of  an  array  accompanied  by  more  digital  tags  that  are 
simultaneously  deployed  or  tags  with  more  specialized  sound  localization  possibilities  would 
bring  us  a  little  closer  towards  identifying  the  vocalizing  individuals.  The  major  difficulty  will 
be  distinguishing  individual  animals  that  vocalize  in  very  close  proximity  to  one  another. 

Until  technical  advances  are  able  to  compensate,  sound  localization  may  be  restricted  to 
behaviors  in  which  vocalizing  groups  or  individuals  are  sufficiently  separated  in  space  from 
each  other.  Any  progress  in  this  domain  will  offer  insights  into  the  social  and  behavioral 
function  of  killer  whale  vocal  activity. 

The  syntax  chapter  motivates  further  inquiry  into  how  animals  acquire  their 
stereotyped  vocal  repertoire.  Human  infants  go  through  a  babbling  stage  where  they 
produce  numerous  phonemes  but  only  some  of  these  are  subsequently  retained  for  speech 
once  they  become  verbal.  To  characterize  the  vocal  development  of  killer  whales,  it  would 
be  useful  to  explore  whether  they  acquire  their  repertoires  by  proceeding  through  a  similar 
babbling  stage.  If  syllables  are  produced  in  isolation  or  in  unusual  orders  early  in  life,  this 
might  suggest  that  phonemes  are  largely  inherited  while  the  group-specific  sequences  are 
learned.  If,  however,  the  stable  syllables  do  not  emerge  until  later  in  vocal  development, 
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these  might  be  vocally  learned  through  imitation.  This  work  would  be  conducted  most 
productively  in  captivity  where  the  vocal  ontogeny  and  acoustic  environment  of  a  newborn 
calf  could  be  tracked  closely.  If  killer  whales  do  babble,  I  would  expect  that  early  in 
development  their  vocal  behavior  would  be  characterized  by  more  variable  calls.  As  they 
learn  their  repertoire  over  time,  they  would  produce  increasing  amounts  of  stereotyped 
calling  activity  until  reaching  a  certain  threshold,  which  may  vary7  depending  on  behavioral  or 
social  context.  The  captive  setting  would  also  afford  an  opportunity  to  explore  whether 
killer  whales  can  learn  new  syllables  and/or  new  orders  of  already  established  syllables.  Such 
a  demonstration  would  provide  strong  evidence  for  vocal  learning  and  phonological  syntax 
in  these  animals. 
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Appendix  1.  Additional  Movement  Plots 


Al.l  Pitch,  depth  and  Az  variation  profiles  centered  on  focal  tail  slaps 

Pitch  (left),  depth  (center)  and  Az  variation  (right)  profiles  centered  on  focal  tail  slaps  (gray  dotted  line).  Each 
line  color  corresponds  to  a  unique  focal  tail  slap  and  is  consistent  across  each  horizontal  triptych  of  plots. 
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The  pseudo-track  is  a  non-geo-referenced  inertial  path.  1  lorizontal  movement  is  plotted  as  relative  distance  and  the  beginning  and  ending  of  this  track  are 
shown  (□  and  o,  respectively).  In  the  left  panel,  depth  is  colorized,  the  filled  gray  circles  represent  rail  slaps  produced  by  a  non- ragged  individual,  and  the  red 
circles  indicate  tail  slaps  produced  by  the  tagged,  focal  animal.  In  the  right  panel,  the  red  sections  of  the  track  correspond  to  periods  of  low  linearity' 
(thresholded  using  the  focal  tail  slap  data,  see  Chapter  2). 
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A1.3  Movement  data  summary  plots 

The  linearity  index  (I. I)  is  plotted  as  a  thin  continuous  black  line  in  the  two  uppermost  panels  and  ranges  from 
0  (no  displacement)  to  1  (straight  path).  The  LI  plots  two  rough  states:  a  low  linearity  (high  circuitous)  state 
when  the  animals  were  tail  slapping  and  a  high  linearity  (low  circuitous)  state  when  few  tail  slaps  were  observed 
(see  Figure  2.9).  Dive  duration  and  maximum  dive  depth  are  indicated  with  the  magenta  squares  and  red 
circles  in  the  left  and  right  panels,  respectively.  The  small  gray  squares  and  open  black  circles  on  the  LI  curve 
indicate  non-focal  and  focal  tail  slaps,  respectively.  The  two  tail  slapping  (TS)  penods  are  indicated  by  the 
black  horizontal  bars  at  the  top  of  the  plot.  The  low  LI  episodes  are  shown  by  the  blue  horizontal  bars. 
Beginning  in  the  second  row  and  reading  left  to  right,  the  remaining  panels  plot  the  change  in  pitch  (degrees), 
depth  (m),  change  in  roll  (degrees),  vertical  velocity  (m/s),  residual  heading  (degrees)  and  variation  in  the  z-axis 
of  the  accelerometer  (a  proxy  for  fluking  energy,  relative  units).  Time  is  reported  locally  and  runs  ldenticallv 
along  the  x-axis  of  each  panel. 
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Appendix  2.  Norwegian  Call  Types 


Spectrograms  of  Norwegian  call  types  recorded  during  the  tagouts  listed  in  square  brackets.  Call  types  from 
N1  to  N34  were  desenbed  by  Strager  (1993),  call  types  from  N35  to  N63  were  defined  bv  Van  Opzeeland  et  al. 
(2005)  and  call  types  N64  to  NT 03  were  newly  added  here. 
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Appendix  3.  Norwegian  Stereotyped  Call  Contours 
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Appendix  4 


Preliminary  evidence  for  signature  vocalizations  among 
free-ranging  narwhals  ( Monodon  monoceros)a) 

Ari  D.  Shapirob' 

Hittloffv  Dtfkiritnent.  Wtxkls  Holt1  Oeeanoyruphie  Instil  niton.  MS  050.  Wottds  Httie.  Muwat  husciis  0254* 
(Received  14  March  2(KX>;  revised  9  June  2(XX>:  accepted  21  June  2006) 

Animal  signature  vocalizations  that  arc  distinctive  at  the  individual  or  group  level  can  facilitate 
recognition  between  conspecilics  and  re-establish  contact  with  an  animal  that  has  become  separated 
from  its  associates.  In  this  study,  the  vocal  behavior  of  two  free-ranging  adult  male  narwhals 
\Mtmodon  numocems)  in  Admiralty  Inlet.  Baffin  Island  was  recorded  using  digital  archival  tags. 
These  recording  instruments  were  deployed  when  the  animals  were  caught  and  held  onshore  to 
attach  satellite  tags,  a  protocol  that  separated  them  from  their  groups.  The  signature  content  of  two 
vocal  categories  was  considered:  ( I )  combined  tonal/pulsed  signals,  which  contained  synchronous 
pulsatile  and  tonal  content:  (2)  whistles,  or  frequency  modulated  tonal  signals  with  harmonic  energy 
Nonparamctric  comparisons  of  the  temporal  and  spectral  features  of  each  vocal  class  revealed 
significant  differences  between  the  two  individuals.  A  separate,  cross-correlation  measure  conducted 
on  the  whistles  that  accounted  for  overall  contour  shape  and  absolute  frequency  content  confirmed 
greater  interindividual  compared  to  intraindividual  differences.  These  data  arc  consistent  with  the 
hypothesis  that  narwhals  produce  signature  vocalizations  that  may  facilitate  their  reunion  with  group 
members  once  they  become  separated,  but  additional  data  are  required  to  demonstrate  this  claim 
more  rigorously.  ©  200b  Acoustical  Society  of  America.  [DOI:  10.1121/1.2226586] 

FACS  Humberts):  43.80.Ka,  43.80.Ev  [WAA]  Pages:  1695-1705 


I.  INTRODUCTION 

Signatua*  vocalizations  of  animals  acoustically  encode 
individual  or  group  identity  and  are  characterized  by  unique 
sets  of  spectral  and/or  temporal  attributes.  The  specific 
acoustic  features  required  to  distinguish  between  individuals 
according  to  their  vocalizations  have  been  measured  in  a  host 
of  laxa  [e  g.,  birds:  macaroni  penguins  ( Eudyptcs  cliryxolo - 
ffhu.s).  Searby  cl  ai.  2004.  chiroptcrans:  evening  bats  (Mr 
ticcius  humeral  is).  Schcrrer  and  Wilkinson.  1 993.  canids: 
timber  wolves  (Catiis  Input).  Goldman  el  ul..  1995.  primates: 
common  marmosets  {Callithrix  jacchus).  Jones  ct  ai.  1993, 
pinnipeds:  subantarctic  fur  seals  [Antocephalus  tmpicalis). 
Charrier  cl  al..  2001.  2003.  cetaceans:  bottlenose  dolphins 
{Tursiopx  tr  uncat  us).  Janik.  1999.  Wat  wood  el  ai.  2005]. 
Playback  experiments  have  demonstrated  that  animals  can 
recognize  signature  signals  and  have  illustrated  the  diversity 
of  contexts  in  w  hich  signature  vocalizations  are  used,  includ¬ 
ing  facilitating  recognition  between  an  infant  and  one  or  both 
of  its  parents  [cliff  swallows  ( Hirundo  pyrrlumoia):  Stoddard 
and  Beecher,  1983,  tree  swallows  (Tachycinctu  bicolor):  Le¬ 
onard  cl  ai.  1997.  Mexican  free-tailed  bats  ( Tadarida  brasil - 
iensis  mcxicana):  Balcombe,  1990.  fur  seals  ( Antocephalus 
tmpicalis):  Charrier  cl  ai.  2(X)I.  2003.  bottlenose  dolphins: 
Sayigh  ci  al..  1998.  Janik  el  al..  2006].  male-pair  recognition 
[king  penguins  (Aptenodyics  patayonicus):  Lengagne  ci  ai. 
2000].  and  group  affiliation  associated  with  territorial  de 
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lease  (North  American  bullfrog  (thina  ealcslu  iana).  Bee  and 
Gerhardt,  2(X)2.  Arctic  foxes  (Alopcx  lay  opus):  Frommolt  ci 
ai.  2003).  There  are  selective  benefits  for  the  signals  pro¬ 
duced  in  these  contexts.  Recognition  is  very  important  when 
one  or  both  parents  must  allocate  a  finite  amount  of  re¬ 
sources  to  their  offspring,  a  scenario  in  which  confusion  is 
associated  with  high  fitness  costs.  Mates  or  groups  of  indi¬ 
viduals  that  consistently  defend  one  another,  their  young,  or 
their  territory  can  benefit  from  individual  or  group  recogni¬ 
tion  because  it  provides  a  system  for  remembering  with 
whom  they  have  shared  mutual  investments 

The  proximate  methods  for  achieving  signature  recogni¬ 
tion  can  include  imprinting,  habituation,  associative  learning, 
and  vocal  learning.  Vocal  learning  occurs  when  the  respira¬ 
tory,  phonutory,  and/or  tiller  systems  urc  employed  to  render 
signals  more  or  less  similar  to  acoustic  models  that  are  en¬ 
countered  through  experience  with  other  individuals  (Janik 
and  Slater.  1997.  2(XX)I.  Contact  calls,  generally  used  by  am 
mals  when  they  become  separated  from  their  social  part- 
ncr(s)  or  group  to  first  locate  one  another  and  then  mediate 
reunion,  contain  signature  content  in  certain  species  and  ap¬ 
pear  to  be  vocally  learned  When  placed  in  social  groupings 
of  unfamiliar  individuals,  the  contact  calls  of  male  budgeri¬ 
gars  (Melopsiliacus  undulatus).  for  example,  initial l>  eon- 
verged  and  subsequently  underwent  continuous  and  synchro¬ 
nous  changes  (Farabaugh  ct  ai.  1994).  Evidence  is 
accumulating  for  some  species  of  nonhuman  primates  to  pos¬ 
sess  vocal  plasticity  during  adulthood  despite  its  apparent 
absence  during  development  (see  review  by  Egnor  and 
Hauser,  2(X)4).  Male  chimpanzees  [Pan  tmyhulMes)  produce 
pant  hoots,  long-distance  vocalizations  that  seem  to  function 
m  maintaining  contact  with  and  attracting  allied  individuals 
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(Milam  and  Nishida.  1993).  Pani  hoot  convergence  was  ob¬ 
served  among  both  chorusing  dyadic  pairs  (Milani  and  Gros- 
Louis.  1998)  and  larger  groups  containing  3-11  adult  males 
(Marshall  et  at..  1999).  Similarly,  several  spectral  and  tem¬ 
poral  parameters  of  the  contact  calls  among  pygmy  marmo¬ 
sets  ( Cebuctla  pygmaea).  referred  to  as  trills,  underwent  par¬ 
allel  or  convergent  shifts  between  new  adult  mate  pairs 
(Snowdon  et  at..  1997;  Snowdon  and  Elowson.  1999).  Com¬ 
parable  observations  were  made  when  two  naive  pygmy  mar¬ 
moset  social  groups  of  mixed-age  composition  were  intro¬ 
duced  (Elowson  and  Snowdon,  1994). 

In  principle,  it  is  possible  to  conclude  that  signals  con¬ 
tain  signature  content  by  demonstrating  more  interindividual 
than  intraindividual  variability.  To  offer  sufficient  proof,  this 
result  must  be  shown  for  at  least  several  exemplars  of  the 
signature  signals  of  each  of  several  individuals.  One  of  the 
most  striking  examples  of  signature  vocalizations  is  found 
among  bottlenose  dolphins  whose  signature  whistles,  lirsl 
identified  by  Caldwell  and  Caldwell  (1965).  appear  to  func¬ 
tion  as  vocally- learned  contact  calls.  Among  bottlenose  dol¬ 
phins.  whistle  imitation  appears  to  be  an  important  agent  in 
the  ontogeny  of  this  signature  vocalization  (Sayigh  el  at.. 
1990,  Miksis  el  at.,  2002)  and  in  social  communication  both 
in  captivity  (Tyack.  1986)  and  in  the  wild  (Janik.  2000.  Fnpp 
et  at.,  2005).  Bottlenose  dolphin  mother-calf  pairs  were  more 
likely  to  orient  towards  one  another’s  signature  whistles 
when  separated  from  each  other  and  temporarily  restrained 
than  those  of  other  individuals  of  the  same  corresponding 
age  cohort  (Sayigh  et  at..  1998.  Janik  et  at..  2006).  Captive 
animals  produced  their  signature  whistles  most  often  when 
they  separated  themselves  voluntarily  and  spontaneously 
from  their  mixed-age  group  by  swimming  into  an  adjacent 
tank  compared  to  when  they  were  swimming  together  (Janik 
and  Slater.  1998).  The  remaining  dolphins  were  also  more 
likely  to  produce  their  signature  whistles  when  an  individual 
left  the  main  tank.  Finally,  adult  males  that  had  strong  social 
bonds  with  another  male  were  most  likely  to  use  signature 
whistles  when  they  were  separated  either  due  to  temporary, 
artificial  restraint  or  voluntarily  w  hen  they  were  free-ranging, 
presumably  to  facilitate  an  eventual  reunion  (Watwood  et  at., 
2005).  Collectively,  these  studies  reveal  the  importance  of 
signature  whistles  in  maintaining  contact  between  bottlenose 
dolphin  individuals  in  artificial  and  natural  settings  alike  and 
in  both  involuntary  and  voluntary  contexts. 

Despite  some  debate  (see  McCowan  and  Reiss,  1995. 
2001:  Janik.  1999  for  discussion:  Smolker  and  Pepper. 
1999).  the  studies  just  described  have  helped  to  solidify  the 
case  for  signature  whistles  in  bottlenose  dolphins  and  sug¬ 
gest  the  possibility  of  signature  vocalizations  among  other 
long-lived,  social  odontocetes  in  an  underwater  environment 
where  acoustic  signaling  is  the  most  reliable  and  efficient 
form  of  communication.  In  addition,  signature  information 
can  also  be  encoded  at  the  group  level,  which  can  form  the 
basis  for  acoustic  badges  of  membership  (e.g..  Boughman. 
1997).  For  example,  five  of  six  shared  call  types  among 
killer  w  hales  (On  inns  on  a)  revealed  distinctive  structural 
differences  between  matrilineal  units  (Miller  and  Bain.  2(XX)) 
and  it  is  possible  that  other  signature  details  allow  differen¬ 
tiation  between  individual  animals  as  well  (Nousck  et  at.,  in 
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press).  Cultural  divergence  of  discrete  call  types  appears  to 
account  for  some  of  the  subtle  differences  in  the  temporal 
and  spectral  features  across  these  matrilineal  units  and  even 
within  pods  (Deecke  et  at..  2000). 

The  principal  challenge  for  studying  the  signature  sig¬ 
nals  of  marine  mammals  involves  the  difficulty  of  assigning 
vocalizations  in  the  wild  unambiguously  to  the  individual 
animal  that  produced  them.  In  this  manuscript,  the  possibility 
of  signature  signals  among  free-ranging  narwhals  (M (motion 
monocents)  was  examined  by  recording  the  acoustic  activity 
of  two  individuals  with  digital  archival  tags.  These  gregari¬ 
ous.  long-lived  Arctic  odontocetes  migrate  distances  of  thou¬ 
sands  of  kilometers  in  large  numbers  with  subpopulations 
moving  in  a  coordinated  fashion  (Hay  and  Mansfield,  1989; 
Dietz  and  Heide-Jorgensen,  1995:  Laidre  et  at..  2(X)4).  They 
travel  in  groups  that  are  often  sex  segregated  and  range  in 
size  from  a  few  animals  to  dozens  of  individuals,  although 
the  stability  or  fluidity  and  interconnectedness  of  these  as¬ 
semblages  remain  unknown  (reviewed  in  Hay  and  Mansfield. 
1989). 

Narwhals  produce  ccholocation  clicks  with  repetition 
rates  between  2  and  >5(X)/s  (Ford  and  Fisher.  1978.  Mohl  et 
at..  1990).  maximum  frequencies  reaching  at  least  160  kHz 
(Miller  et  at..  1995)  and  maximal  source  levels  reaching 
218  dB  re  I  pPa  (Mohl  et  at.,  1990).  Miller  et  al.i  1995) 
arbitrarily  divided  clicking  into  the  two  categories  of  train 
clicks  produced  at  ^30  clicks/s  and  burst  clicks  produced  at 
2*40  clicks/s.  Pulsatile  sounds  featuring  a  repetition  rate 
high  enough  to  possess  a  tonal  character  with  harmonically 
related  sidebands  (see  Watkins,  1967)  were  called  longer 
click  series  by  Watkins  et  at.  (1971)  and  pulsed  tones  by 
Ford  and  Fisher  ( 1978).  Characterized  as  narrow -band,  these 
signals  had  durations  between  0.56  and  several  seconds  and 
spectral  energy  ranging  from  500  Hz  to  24  kHz.  The  repeti¬ 
tion  rate  was  generally  constant  although  Watkins  et  at. 
(1971)  reported  u  tendency  for  the  repetition  rate  of  these 
vocalizations  to  increase  at  the  very  beginning  and  slow 
down  towards  the  end.  In  this  manuscript,  these  signals  will 
be  referred  to  as  combined  tonal/pulsed  signals.  Finally, 
narrow -band,  frequency  modulated  (FM)  whistles  have  been 
described  that  generally  last  <  1 .0  s  (range:  0.1 -6.0  s)  and 
have  a  frequency  range  between  300  Hz  and  18  kHz  (Ford 
and  Fisher.  1978:  Mohl  et  at..  1990). 

Although  Ford  and  Fisher  (1978)  did  not  lind  any  evi¬ 
dence  for  signature  content  among  whistles,  they  speculated 
that  the  different  pulsed  tones  in  their  recordings  were  pro¬ 
duced  by  separate  individuals  as  signature  calls  in  a  social 
context.  They  recorded  series  of  the  same  tone  growing 
louder  and  then  softer,  concluding  that  this  resulted  from  one 
individual  producing  each  series  as  it  approached  and  then 
swam  past  a  stationary  hydrophone.  This  possibility  was  not 
conclusive  since  groups  of  animals  were  sw  imming  by  the 
recorder  and  multiple  individuals  could  have  been  producing 
each  tone.  In  addition,  no  data  on  differences  in  acoustic 
parameters  were  available  to  quantify  the  distinctiveness  of 
the  calls.  In  this  study,  we  examined  the  possibility  of  signa¬ 
ture  vocalizations  among  free-ranging  narwhals  more 
closely.  The  results  support  this  hypothesis  for  both  com¬ 
bined  tonal/pulsed  signals  and  whistles,  suggesting  a  social 
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FIG.  I  Narwhal  xhown  with  (riltercam  (contained  within  the  dashed  el¬ 
lipse)  ami  DTAG  (contained  within  the  dashed  rectangle)  attached  immedi¬ 
ately  before  release.  f*hotograph  courtesy  of  Rune  Diet/ 

function  for  vocal  production  thai  is  distinctive  either  at  the 
individual  or  group  level.  Further  work  is  recommended  to 
confirm  signature  vocal  production  among  additional  ani¬ 
mals  and  to  ascertain  the  natural  function  of  these  vocaliza¬ 
tions  in  the  wild. 

II.  METHODS 

A.  Study  area 

Field  work  was  conducted  from  8-23  August  2(X>4  at 
Kakiak  Point.  Admiralty  Inlet  on  Baffin  Island  in  Nunavut. 
Canada  (73l40'N.  86°40*W).  The  inlet  has  a  maximum 
depth  of  720  m.  Groups  of  narwhals  ranging  from  approxi¬ 
mately  5-30  individuals  (pers.  obs.)  traveled  into  the  inlet  at 
this  time  of  year  once  the  ice  had  mostly  melted.  The  field 
camp  occupied  a  position  about  500  m  from  a  site  used  in¬ 
termittently  by  the  lnuit  to  hunt  narwhals. 

B.  Equipment 

This  experiment  employed  a  digital  archival  tag  (DTAG) 
developed  by  Johnson  and  Tyack  (2003)  featuring  a  single 
hydrophone,  pressure  and  temperature  sensors,  and  a  triaxial 
accelerometer  and  magnetometer,  which  recorded  to  fiash 
memory'.  The  sampling  rate  of  the  hydrophone  was  set  to 
%  kHz  while  the  other  sensors  sampled  at  50  Hz.  A  16  bit 
ADC  was  used.  Sigma  della  conversion  provided  an  effec¬ 
tive  antialiasing  filter,  dispensing  w  ith  aliasing  caused  by  en¬ 
ergy  exceeding  the  Nyquist  frequency  of  48  kHz.  The  tag 
attached  noninvasively  to  individual  animals  via  suction  cups 
and  its  release  was  coupled  to  the  release  mechanism  of  the 
National  Geographic  Crittcrcam  (see  Marshall.  1998)  (hat 
was  deployed  simultaneously.  A  VHF  transmitter  signaled 
the  location  of  an  attached  tag  intermittently  as  the  animal 
surfaced  and  then  regularly  once  the  lag  was  released  and 
flouted  to  the  water's  surface. 

C.  Capturing  and  tagging  protocol 

The  DTAG  was  deployed  in  collaboration  with  a  satel¬ 
lite  tagging  project  that  required  working  with  the  animals 
onshore.  As  described  by  Dietz,  et  ai  (2(X)1).  a  50  m  long 
and  10  m  deep  black  net  with  20  X  20  cm  mesh  was  oriented 
perpendicular  to  the  shore  and  kept  afloat  with  7-8  white 
buoys.  The  nef  was  secured  to  the  shore  and  in  the  water. 


FIG.  2.  Sample  spectrograms  (largo.  top  plots)  and  wax  c  lor  ms  (smullct. 
bottom  plots)  ol  a  combined  lomd/pulurtl  signal  produced  by  individual  (a) 
mm224  and  (b)  inm226  with  a  FIT  si/C  ami  Iraine  length  of  512  points. 
50*4  window  overlap,  and  a  maximum  frequency  displayed  ol  4S  kH/  The 
low  frequency  energy  associated  with  rm»M  of  the  pulses  is  likely  due  to  the 
resonance  of  the  air  sms  involved  in  sound  production  or  transmission.  The 
solid  arrows  in  both  spectrograms  indicate  the  synchronous  FM  tonal  com 
poneni  produced  by  the  tagged  animal 


When  weather  conditions  permitted,  the  net  was  deployed 
and  monitored  constantly  for  caught  animals,  signaled  by  the 
submergence  of  at  least  one  of  the  buoys.  As  stx>n  as  a  w  hale 
became  entangled,  two  bouts  were  dispatched  immediately  to 
bring  the  animal  to  the  surface  to  broathe  and  tow  it  to  shore 
with  the  assistance  of  a  land-based  crew  hauling  on  the  net 
Once  an  animal  was  caught,  the  remainder  of  its  group 
moved  out  of  visual  range,  presumably  continuing  their  mi¬ 
gration  deeper  into  the  inlet.  A  fluke  belt  was  used  to  keep 
the  animal  ashore  and  oriented  w  ith  its  head  submerged  and 
pointed  into  the  water  while  its  blowhole  hud  access  to  the 
air  at  all  times.  Three  males  and  five  females  were  captured 
in  all.  During  satellite  tag  attachment,  blood  samples  were 
collected  to  assess  overall  health  and  stress  levels. 

Just  before  two  of  the  adult  males  and  one  of  the  adult 
females  were  guided  back  to  deeper  water,  a  DTAG  was 
attached  to  their  dorsal  sides  ~0.5  m  caudal  to  the  blow  hole 
(Fig.  I ).  These  animals  were  not  followed  visually  once  they 
were  released  from  shore  so  it  w  as  not  possible  to  determine 
whether  they  eventually  reunited  with  their  group  members. 
The  VHF  signal  was  monitored  from  the  field  camp  on  shore 
using  two  handheld  vagi  antennae.  Once  a  regular  VHF  sig- 
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FIG.  3.  Spectrogram  composite  of  all 
four  whistles  of  mm2 24  (a~d)  ami  14 
of  the  17  whistles  of  mm22h  (f-s> 
with  a  FFT  st/c  and  frame  length  of 
512  points.  50**  window  overlap,  and 
a  maximum  frequency  displayed  of 
20  kH/.  The  remaining  3  whistles  of 
mm226  resembled  il»osc  displuycsl 
here  but  were  excluded  for  graphical 
convenience.  The  waveforms  dis¬ 
played  in  subplots  e  and  »  are  of  the 
same  whistles  used  to  generate  sub¬ 
plots  d  and  s.  respectively. 


nal  was  detected  from  a  tag  that  had  released  from  the  animal 
and  the  weather  permitted,  a  boat  was  dispatched  for  recov¬ 
ery.  If  the  VHF  signal  grew  too  faint  to  delect  from  shore, 
tracking  was  conducted  from  a  higher  altitude  on  the  nearby 
cliffs  for  improved  range.  The  first  tag  recorded  for  2.54  h 
(male  mm224),  the  second  tag  for  12.14  h  (male  mm226). 
and  the  third  tag  was  not  recovered.  These  two  lagged  males 
entered  the  inlet  two  days  apart,  strongly  suggesting  that  they 
belonged  to  different  social  groups.  The  data  were  offloaded 
and  burned  to  CD  in  duplicate  in  the  field. 

D.  Vocalization  extraction 

The  14.68  h  of  recordings  were  audited  by  listening  to 
and  visually  examining  the  spectrograms  in  15  s  segments. 
Focal  (tagged  animal)  vocalizations  were  marked  according 
to  their  starting  time  and  vocal  category.  It  was  assumed  that 
vocalizations  with  a  relatively  high  signal  to  noise  ratio 
(SNR)  belonged  to  the  focal  animal  and  not  a  neighboring 
non  focal  animal.  Although  this  assumption  could  not  be  veri¬ 
fied  visually  because  the  lagged  narwhals  were  not  followed, 
it  was  true  for  at  least  the  first  few  dives  since  no  group 
members  were  observed  in  the  immediate  vicinity.  Much 
softer  sounds  were  often  heard  on  the  recordings,  presum¬ 
ably  from  more  distant,  vocalizing  nonfocal  animals.  The 
SNR  of  these  focal  vocalizations  was  computed  by  compar¬ 
ing  the  rxx)t  of  ihe  mean  of  the  squared  pressure  (RMS)  along 
the  window  containing  90*2  of  the  signal  energy  to  a  seg¬ 
ment  of  noise  of  the  same  duration  immediately  preceding 
the  signal  (Madsen.  2005).  The  analysis  presented  here  only 
excluded  echolocation  clicks,  or  broadband  pulses  of  energy 
with  ituerclick  intervals  usually  greater  than  100  ms.  Nearly 
all  remaining  vocalizations  were  considered  that  could  be 
divided  into  the  two  discrete  categories  of  (1)  combined 
tonul/pulsed  signals  (Fig.  2),  defined  as  uninterrupted  pulsa¬ 
tile  vocalizations  with  a  synchronously  produced  FM  fonal 

J.  Acoust.  Soc.  Am..  Vol.  120.  No.  3.  September  2006 


component,  a  low  mean  interpulse  interval  (IPI  <  13  ms), 
and  high  pulse  number  049).  and  (2)  whistles  (Fig.  3). 
which  were  characterized  by  FM.  tonal  energy  with  several 
harmonics.  See  the  discussion  in  this  manuscript  for  a  com¬ 
parison  of  these  designations  to  earlier  classification  sche¬ 
mata.  All  vocalizations  were  saved  as  separate  wav  files.  In¬ 
dividual  mm224  produced  42  combined  tonal/pulsed  signals 
and  4  whistles  and  mm226  produced  31  combined  tonal/ 
pulsed  signals  and  17  whistles. 

E.  Analysis  of  combined  tonal/pulsed  signals 

Pulses  were  located  automatically  using  customized 
Matlab  7.1  (The  MathWorks.  Inc.)  software  that,  given  a 
user-provided  threshold  value,  triggered  on  and  marked 
abrupt  peaks  in  the  pressure  waveform.  Subsequent  inspec¬ 
tions  of  all  waveforms  were  made  to  select  undetected  and 
remove  erroneously  marked  pulses.  A  nonparamctric  Wil- 
coxon  rank  sum  test  with  a  Bonferroni  correction  was  used 
to  examine  whether  the  four  parameters  of  average  IPI.  du¬ 
ration.  number  of  pulses,  and  pulse  repetition  rate  were  sig¬ 
nificantly  distinguishable  between  the  two  individuals.  The 
pulse  repetition  rate  and  the  normalized  pulse  number  were 
also  plotted  as  functions  of  the  normalized  duration  to  pro¬ 
vide  a  visual  means  of  comparing  these  sounds. 

F.  Whistle  extraction  and  analysis 

The  fundamental  frequency  contour  of  each  whistle 
spectrogram  (FFT  size  and  frame  length  of  2(>48  points  with 
50f/f  window  overlap)  was  traced  by  hand  with  customized 
Matlab  software  (Fig.  4).  One  hundred  equally  spaced  points 
were  extracted  from  these  contours  and  normalized  to  a  time 
axis  between  0  and  I  (see  Wutwood  el  al..  2(X)4.  2(X)5).  Two 
tests  of  similarity  were  conducted  on  these  whistle  contours: 
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RG  4  Digitized  tracc>  of  the  turulu- 
im-nul  Ircqucixy  conltiurs  of  ihc 
wIiivUcn  displayed  in  hip  3  Kach 
trace  in  shown  wnh  100  cquully 
spuvciJ  point*  ihul  have  been  normal 
ized  on  a  horizontal  time  axis  from  0 
l»i  I  Again.  panels  a-d  correspond  to 
the  whiNile*  produced  by  mm2  2  4  ami 
f-s  to  those  by  mm22b  Nine  the  dif¬ 
ference  in  t Ik*  frequency  ranges  lor  the 
two  mdiNiduaK 


1.  Nonparametric  comparison 

One  temporal  (original  duration  before  normalization) 
and  live  spectral  (minimum,  maximum,  mean,  initial,  and 
ending  frequencies)  features  were  determined  for  everv 
whistle  (Fig.  5).  These  parameters  were  selected  because 
they  summarized  the  timing  and  coarse  frequency  content  of 
the  whistles  The  differences  between  the  liner  aspects  of  the 


RG.  5.  Illustration  of  temporal  and  spectral  features  extracted  from  a  traced 
whistle  produced  hy  mm224  t  subplot  d  in  l  ips.  3  and  4i  The  initial  and 
ending  frequencies  arc  indicated  by  filled  circles  (•)  while  the  ocher  spectral 
components  are  marked  with  luvri/onial  dashed  lines.  Note  that  the  mean 
t requeues  is  closer  to  ilic  minimum  frequency  because  -MP*  of  ihc  who 
lie’s  frequency  content  lies  below  3700  Hz. 
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frequency  contours  were  reserved  for  the  cross-correlation 
comparison.  Again,  a  nonparametric  Wilcoxon  rank  sum  tesi 
w  ith  a  Bonlerroni  correction  was  used  to  compare  these*  lea 
lures  between  individuals. 

2.  Cross-correlation  comparison 

Cross-correlation  is  often  used  in  signal  processing  as  a 
tool  for  determining  the  similarity  Km  ween  two  signals  Be¬ 
cause  the  frequency  ranges  of  the  whistles  from  the  two  in¬ 
dividuals  were  distinct  (e  g.,  the  average  maximum  Ire 
quenev  of  mm22b  was  less  than  the  average  minimum 
frequency  of  mm224).  the  cross-correlation  measurement 
used  here  was  designed  to  account  for  overall  contour  shape 
and  absolute  frequency  content.  In  the  equation 

K/»  -/«(<))(/^i)  -/*  (ill! 

>  - ====== - - -  1 1 ) 


i  is  the  sample  number  that  ranges  between  I  and  100.  / ,(/) 
and  f/fU )  correspond  to  the  /th  frequency  value  of  contours 
A  and  B.  respectively,  and  fH  (i)  is  the  ith  frequency  value 
of  contour  H  after  it  has  been  slid  along  the  frequency  axis 
to  minimize  the  frequency  differences  Ktwccn  contours  A 
and  H.  These  terms  arc  illustrated  in  Fig.  (>.  Larger  values 
of  this  cross-correlation  measure  indicated  greater  differ¬ 
ences  between  contours  than  smaller  values.  A  value  of  0 
would  reveal  no  difference  at  all  in  contour  shape.  A  con 
strained,  nonlinear  minimization  routine  was  used  to  de¬ 
termine  /jj  (i).  The  first  term  in  the  product  of  the  numera¬ 
tor  of  ( I )  is  the  difference  between  points  along  the  actual 
contours  normalized  in  time  (Fig.  6(a)).  The  second  term, 
however,  returns  a  smaller  number  if  the  frequency  modu¬ 
lation  pattern  is  similar  between  the  whistles  regardless  of 
the  absolute  frequency  offset  of  the  two  [Fig.  f>(b)]. 
Whistles  therefore  could  have  achieved  a  higher  similarity 
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TABLE  II.  Summary  statistics  of  ihc  acoustic  feature*  of  whistles. 


FIG.  6.  Illustration  of  points  used  for  cross-correlation  comparison  of 
whistles  (see  text  lor  the  equation).  In  (a),  contour  A  (darker,  from  mm226: 
subplot  i  in  Figs.  3  ansi  4)  and  fi  (lighter,  from  mm224:  subplot  d  in  Figs.  3 
and  4i  are  depicted  normalized  in  lime  wiih  their  original  frequency  content. 
In  (h).  contour  H  has  been  shifted  along  the  frequency  axis  to  minimize  the 
frequency  difference  between  the  two  contours.  All  l(K)  points  along  the 
contours  were  used  to  compute  Hq.  ( 1 ). 


ranking  (a  smaller  value  of  ( I )]  by  overlapping  in  absolute 
frequency,  possessing  similar  overall  contour  shapes  or 
both. 

III.  RESULTS 

Tables  1  and  II  lisi  the  summary  measurements  of  the 
combined  tonal/pulsed  signals  and  whistles.  With  the  excep¬ 
tion  of  a  single  whistle  assigned  to  mm226  with  a  SNR  of 
13.9  dB.  the  remaining  vocalizations  produced  by  both  ani¬ 
mals  were  characterized  by  a  SNR  of  at  least  28.2  dB.  Com¬ 
bined  tonaly pulsed  signals  were  produced  throughout  the  wa¬ 
ter  column  but  tended  to  concentrate  at  particular  depths 
{roughly  70  m  for  mni224  and  20  m  for  mm226.  Fig.  7). 
Whistle  production  occurred  between  20  and  100  m  for 
nim224  but  was  confined  to  the  upper  30  m  for  mm226  (Fig. 
7).  Both  vocal  categories  were  recorded  throughout  the  div¬ 
ing  sequence,  indicating  that  the  behavioral  or  environmental 
contexts  in  which  these  vocalizations  occurred  were  not  gen- 


TABLE  I.  Summary  statistics  of  the  acoustic  features  of  combined  tonal/ 
pulsed  signals. 


Mean 

Std. 

Min. 

Max 

mm224. 

ns  42 

Duration  <s) 

1.6 

0.7 

0.6 

2.7 

Average  IPI  (ms) 

12.9 

4.3 

9.0 

36.1 

Number  of  pulses 

I2K.I 

45.4 

49.0 

202.0 

Pulse  repetition  rate  (pulses/s) 

82.3 

14.2 

28.1 

II2.K 

mrn226. 

n«3l 

Duration  (s) 

1.2 

0.1 

1.0 

1.2 

Average  IPI  tms) 

6.3 

0.3 

5.6 

6.8 

Number  of  pulses 

186.2 

9.6 

156.0 

201  0 

Pulse  repetition  rate  (pulses/s) 

160.8 

7.4 

147.5 

180.5 

Mean 

Std. 

Min. 

Mux. 

Duration  (x) 

mm224.  n  =  4 

1.19 

0.08 

1.09 

1.26 

Minimum  frequency  (Hz I 

1549 

201 

1292 

1775 

Maximum  frequency  (Hz) 

7181 

1386 

5460 

8844 

Mean  frequency  (Hz) 

3638 

370 

3405 

4190 

In  dial  frequency  (Hz) 

4773 

277 

4496 

5145 

Ending  frequency  (Hz' 

1572 

216 

1292 

I7K8 

Duration  <x) 

mm226.  n  s  1 7 

0.78 

0.04 

0.6S 

0.85 

Minimum  frequency  (Hz) 

718 

156 

360 

980 

Maximum  frequency  (Hz) 

1177 

III 

1095 

1501 

Mean  frequency  (Hz) 

1012 

71 

895 

1240 

Initial  frequency  (Hz) 

939 

236 

641 

150) 

Ending  frequency  (Hz) 

1160 

90 

1095 

I486 

erally  restricted  to  a  very  narrow  depth  or  time.  The  two 
animals  responded  differently  immediately  after  handling. 
Many  combined  pulsed/tonal  signals  (17  of  42)  and  one 
whistle  were  produced  by  individual  mm224  just  after  re¬ 
lease  on  his  first  dive  lasting  only  10.8  min.  Individual 
mm226  was  vocally  active,  however,  between  hours  4  and  10 
of  the  deployment  where  he  reached  a  maximum  depth  of 
about  125  m  (data  not  shown).  He  did  not  produce  any  com¬ 
bined  lonal/pulscd  signals  or  whistles  for  the  first  24  dives 
that  exceeded  roughly  10  m  following  his  release,  a  response 
more  closely  resembling  the  silent  reaction  observed  and  dis¬ 
cussed  by  Finley  et  ul.  ( 1990)  of  narwhals  exposed  to  envi- 


a  b 


fraction  of  total  time 

FIG.  7.  Appro m i rule  depths  where  combined  tonal/pulsed  signals  (triangles. 
A)  and  whistles  (circles.  O)  were  produced  adjacent  lo  u  frequency  histo¬ 
gram  of  depth  bins  (bars)  lor  mni224  (a)  and  i»m226  (b).  the  frequency 
plotted  on  the  ahscivsa  is  expressed  as  a  fraction  of  the  total  amount  of  time 
spent  at  all  depths  the  maximum  depths  achieved  for  mm224  and  mm2 2k 
during  the  DTAG  deployments  were  roughly  125  and  210  m.  respectively. 
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mm2  24 
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MO  K  Visual  a'prvvcntaiMWis  i*f  lent 
poral  features  «*l  combined  tonal/ 
pulseil  signals.  Normalized  pulse  num¬ 
ber  flop  two  panels)  and  pulse 
repetition  rale  (pulses  per  second,  hoi- 
toin  two  panels)  as  a  function  of  nor- 
mali/cd  duration  Note  tlx*  clear  differ¬ 
ences  in  general  morpholoev  of  these 
plots  between  lt*c*  two  mdisiduaK 


ronmcnial  disturbances.  Whistles  were  less  common  lhan 
combined  tonal/pulscd  signals,  as  reported  in  earlier  studies 
(Ford  and  Fisher.  1978;  Miller  el  ai.  1995). 

The  combined  tonal/pulsed  signals  lasted  between  0.55 
and  2.68  s  and  contained  between  49  and  202  pulses.  Spec¬ 
trograms  revealed  the  synchronous  production  of  both  pulsa¬ 
tile  energy  in  the  form  of  repeated  broadband  impulses  and  a 
tonal.  FM  component  by  the  tagged  animals  (Fig.  2).  The 
FM  component  was  not  an  analytical  artifact  of  the  pulsatile 
energy  (see  Watkins.  1967)  because  the  fundamental  fre¬ 
quency  of  the  tonal  feature  was  inconsistent  with  the  repeti 
lion  rate  of  the  pulses.  It  is  likely  that  at  least  two  sound 
generating  apparatuses  are  required  to  produce  these  com¬ 
bined  tonal/pulsed  signals  to  achieve  pulsatile  and  FM  en¬ 
ergy  content  simultaneously.  Combined  tonal/pulscd  signals 
were  characterized  by  pulse  rates  between  28  and 
I  13  pulses/ s  for  mm224  and  between  148  and  180  pulses/s 
for  mm226.  Figure  2  reveals  additional  low  frequency  en¬ 
ergy  associated  with  each  pulse,  which  is  likely  the  conse¬ 
quence  of  resonance  of  the  air  sacs  of  the  tagged  animal  and 
not  reverberations  from  or  echoes  off  of  elements  along  the 
inlet  bottom.  Indeed,  no  echoes  consistent  w  ith  target  local¬ 
ization  or  monitoring  position  in  the  water  column  were  de¬ 
tected  in  the  audio  record.  All  whistles  were  between  0.68 
and  1 .26  s  w  ith  frequencies  ranging  between  360  and 
8844  Hz.  Three  of  the  four  whistles  produced  by  mm224 
were  characterized  by  a  brief  (0.1 8-0.24  s)  upsweep,  fol¬ 
lowed  by  a  pause  and  longer  (0.62  -0.66  s)  downsweep 
[Figs.  3(a),  3(b),  and  3(d)].  The  fourth  whistle  was  continu¬ 
ous  but  still  showed  an  upsweep  preceding  the  downsweep 
[Fig.  3(c)].  All  seventeen  whistles  produced  by  mm226  con¬ 
tained  a  brief  (0.026  -  0.091  s)  broadband  segment  with  en¬ 
ergy  that  peaked  between  about  500  and  700  Hz  and  then 
decayed  steadily  until  disappearing  above  8- 10  kHz  fol¬ 
lowed  by  a  flat,  constant  frequency  tone  that  lasted  for  the 
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remainder  of  the  whistle  [Figs.  3(0-3(s).  3  whistles  are  not 
shown].  The  whistles  were  all  of  about  the  same  intensity 
except  for  a  quieter  one  recorded  on  the  tag  attached  to 
mm226  [Fig.  3(j)].  which  may  have  been  softer,  may  have 
been  produced  by  a  non  focal  animal  located  further  from  the 
lag.  or  may  have  had  a  transmission  path  that  was  partially 
obscured  by  the  animal  or  tag  components  before  reaching 
the  hydrophone. 

The  uniqueness  of  each  of  the  two  sets  of  combined 
tonal/pulscd  signals  and  whistles  was  apparent  from  simple 
visual  inspection.  Among  the  combined  tonal/pulscd  signals 
the  patterns  of  how  the  relative  timing  and  repetition  rate  of 
the  pulses  varied  as  a  function  of  normalized  duration  dif¬ 
fered  between  the  two  individuals  (Fig.  8).  All  measured 
features  for  both  the  combined  tonal/pulscd  signals  and 
whistles  differed  significantly  between  the  two  animals  (VYil 
coxon  rank  sum  test  with  a  Bonferroni  correction.  /*  =  ()  (K)2 
for  combined  tonal/pulsed  signals  duration.  P<  0.(X)l  for  re¬ 
maining  combined  tonal/pulsed  signals  measurements.  P 
=  0.011  for  whistle  ending  frequency  and  /*=0.008  for  re¬ 
maining  whistle  measurements).  The  whistles  produced  by 
mm224  were  longer  and  higher  in  every  measurement  com¬ 
pared  to  those  belonging  to  mm226,  which  were  shorter  and 
lower.  Indeed,  the  minimum  frequency  of  mm224  was 
I549±20l  Hz  (.v±vd)  and  the  maximum  frequency  of 
mm226  was  1 1 77 ±111  Hz  (.v±sd).  values  that  did  not  over¬ 
lap  even  a  single  standard  deviation  away  (Table  Ilf  The 
cross-correlation  test  on  the  whistles  revealed  dramatic  dif¬ 
ferences  for  the  interindividual  comparisons  (between 
mm224  and  mm226)  and  only  slight  differences  among  the 
intraindividual  comparisons  (Table  111).  The  interindividual 
results  were  more  different  than  the  intraindividual  results  by 
1-2  orders  of  magnitude. 
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TABLE  III.  Cross-corrclalKHi  comparison  of  whittles  between  the  same  anti 
different  individual*.  These  data  were  computed  in  arbitrary  units  with 
higher  value*  indicating  a  greater  difference  between  the  contours  being 
compared.  The  intraindividual  comparisons  are  italicized. 


min226 

mm224 

mm224 

127  250 

s  m 

mm22f> 

716 

IV.  DISCUSSION  AND  CONCLUSIONS 

Two  free-ranging  narwhals  each  produced  an  acousti¬ 
cally  distinctive  set  of  combined  tonal/pulsed  signals  and 
whistles.  Visual  and  aural  inspection  and  nonparumeiric  and 
cross-correlation  analyses  all  demonstrated  striking  interindi¬ 
vidual  differences  among  these  vocalizations  and  intraindi¬ 
vidual  temporal  and  spectral  fidelity.  These  results  support 
the  claim  of  Ford  and  Fisher  (1978)  that  narwhals  produce 
individually  distinctive  signature  vocalizations.  The  record¬ 
ings  analyzed  here  also  contained  numerous  faint  combined 
tonal/pulsed  signals  and  whistles  produced  by  nonlocal  ani¬ 
mals.  These  observations  are  consistent  with  the  conclusion 
that  these  vocal  categories  are  regularly  produced  by  free- 
ranging  narwhals  in  this  area. 

The  function  of  these  vocalizations  remains  uncertain, 
but  they  do  not  appear  to  facilitate  foraging.  When  feeding, 
some  odontocetes  produce  a  sequence  of  regularly  spaced 
echolocation  clicks  that  precede  a  buzz,  or  a  series  of  clicks 
characterized  by  a  dramatically  elevated  repetition  rate  [e  g., 
sperm  whales  ( Phxseter  macmcephalus):  Miller  et  al.  2(X)4. 
Blainvillc's  beaked  whales  ( Mesoplodon  den  si  rust  r is):  Mad¬ 
sen  et  al.  2005].  It  seems  unlikely  that  the  narwhal  vocal¬ 
izations  quantified  here  were  used  for  foraging  purposes 
since  no  echolocation  clicks  were  delected  immediately  be¬ 
fore  the  combined  tonal/pulsed  signals  or  the  whistles.  In¬ 
deed,  the  kind  of  clicking  behavior  characterized  by  changes 
in  repetition  rate  and  amplitude  that  is  associated  with  forag¬ 
ing  has  been  recorded  from  narwhals  in  previous  studies  (see 
Mohl  et  al..  1990;  Miller  et  al ..  1995).  but  was  not  observed 
here. 

Combined  tonal/pulsed  signals  and  whistles  may  play  a 
role  in  social  communication  based  on  their  stereotypy  (Ford 
and  Fisher.  1978)  and  the  signature  content  shown  by  the 
limited  dataset  presented  here.  These  distinctive  vocaliza¬ 
tions  might  serve  ;ts  contact  calls  to  facilitate  reunions  of 
individuals  with  their  group  members  in  a  manner  similar  to 
that  observed  in  captive  and  free-ranging  bottlenosc  dolphins 
(Janik  and  Slater,  1998.  Watwood  et  al.  2005).  Unlike  the 
studies  conducted  with  bottlenose  dolphins  to  identify  pair 
bonds  or  alliances  among  males  (Connor  et  al.  1992.  2001; 
reviewed  in  Wells.  2003),  little  work  has  been  completed  to 
describe  the  social  structure  and  group  relationships  among 
individual  narwhals.  Based  on  personal  observations,  the 
narwhals  entering  Admiralty  Inlet  traveled  in  groups  ranging 
in  size  from  roughly  5  to  30  animals.  The  group  members 
traveling  with  the  tagged  animals  vacated  the  area  while  their 
companions  were  detained  on  shore.  The  vocalizations  of 
more  distant  animals  that  were  recorded  in  this  study  were 
usually  faint,  suggesting  that  for  the  tagged  animals,  the 
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dives  occurring  after  their  capture  were  likely  solitary  events, 
in  addition,  no  other  animals  appeared  in  accompanying 
video  footage  recorded  from  a  Crittercam  (with  a  visual 
range  extending  between  3  and  20  m  depending  on  the  light 
level)  attached  to  mm226  during  the  first  hour  following  its 
release.  These  observations  are  consistent  with  the  hypoth¬ 
esis  that  the  combined  lonal/pulsed  signals  and/or  whistles 
were  used  by  these  two  narwhals  as  contact  calls  in  an  effort 
to  regain  contact  w  ith  their  groups.  However,  actual  reunions 
with  other  animals  were  not  obvious  from  the  acoustic 
record. 

Because  these  two  animals  were  likely  members  of  dif¬ 
ferent  groups  traveling  into  Admiralty  Inlet,  an  alternative 
explanation  consistent  with  the  results  presented  here  is  that 
these  vocalization  classes  may  have  been  distinctive  at  the 
level  of  the  social  group  (see  Terhunc  et  al.  2001;  WeiB  et 
al,  2006).  Another  possibility  is  that  combined  tonal/pulscd 
signals  and  whistles  are  actually  used  as  signature  vocaliza¬ 
tions  to  cue  conspecifics  about  individual  identity.  The 
whistle  of  mm226  was  characterized  by  nearly  constant  fre¬ 
quency  except  for  the  brief  noisy  segment  at  the  very  begin¬ 
ning.  From  an  information  theory  perspective,  a  Hat  whistle 
encodes  less  information  compared  to  a  frequency  modulated 
whistle.  It  is  possible,  however,  that  even  flat  whistles  of 
consistently  distinctive  durations  or  pilches  could  be  used  to 
distinguish  between  individuals.  The  contours  of  the  whistles 
of  mm226  appeared  very  similar  to  the  FM  component  of  the 
combined  tonal/pulsed  signals  of  this  animal.  The  whistles 
produced  by  mm224.  however,  contained  the  frequency 
modulation  expected  of  signature  vocalizations  and  did  not 
resemble  this  animal’s  combined  tonal/pulsed  signals  FM 
component. 

The  combined  tonal/pulsed  signals  described  here  most 
closely  resembled  the  longer  click  scries  and  the  pulsed 
sounds  described  by  Watkins  et  al  (1971)  and  Ford  and 
Fisher  (1978),  respectively.  All  of  these  vocalizations  were 
characterized  by  a  combination  of  pulses  and  a  tonal  signal. 
Watkins  et  al.  (1971)  described  the  repetition  iate  of  their 
longer  click  series  tending  to  increase  before  becoming  con¬ 
stant  and  eventually  slowing  down,  somewhat  similar  to  the 
trend  observed  in  the  combined  tonal/pulsed  signals  de¬ 
scribed  here  (Fig.  8).  In  both  the  combined  tonal/pulsed  sig¬ 
nals  recorded  in  this  study  and  their  equivalents  described  in 
Watkins  et  al  (1971)  and  Ford  and  Fisher  (1978).  the 
synchronously- produced  FM  component  creating  the  tonal 
quality  in  those  sounds  was  not  due  exclusively  to 
harmonically-related  sidebands  of  the  repetition  rate  (see 
Watkins.  1967).  Note  in  Fig.  2  that  the  pitch  of  the  FM 
component  docs  not  always  correspond  to  the  repetition  rate 
of  the  pulses.  For  the  combined  lonal/pulsed  signals  pro¬ 
duced  by  mm226.  for  example,  the  FM  component  begins 
over  halfway  through  the  signal  without  any  observable 
change  in  repetition  rate.  Also,  the  tonal  energy  persists  even 
as  the  pulses  slow  down  at  the  end  of  the  combined  tonal/ 
pulsed  signals  attributed  to  individual  mm224  [Fig.  2(a)]  and 
continues  beyond  the  conclusion  of  the  pulses  in  the  signal 
assigned  to  mm226  [Fig.  2(b)].  The  pulsatile  component  of 
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the  combined  tonal/pulsed  signals  quantified  here  was  char¬ 
acterized  by  a  higher  upper  frequency  limit  (up  to  48  kHz.) 
compared  to  earlier  recordings. 

Previous  studies  (Watkins  et  al..  1971:  Ford  and  Fisher. 
1978)  reported  observing  tonal  signals  with  properties  that 
were  both  similar  to  and  different  from  the  whistles  analyzed 
here.  The  frequency  ranges  overlapped  but  the  whistles  that 
were  recorded  here  had  higher  harmonics,  extending  the  up¬ 
per  bound  of  these  tonal  vocalizations  to  at  least  48  kHz 
Earlier  published  tones  were  either  constant  in  frequency  or 
swept  upwards  or  downwards,  again  consistent  with  the 
whistles  presented  here  (Fig.  3).  Both  earlier  works,  how¬ 
ever.  described  whistles  as  narrow-band  signals  lacking  ad¬ 
ditional  detectable  harmonic  content.  All  of  the  whistles  re¬ 
corded  for  this  study  were  typified  by  a  fundamental  FM 
component  and  harmonic  energy,  a  difference  which  may 
have  resulted  from  the  higher  sampling  rate  of  the  recording 
equipment  and/or  the  elevated  signal  to  noise  ratio  due  to  the 
close  proximity  of  the  hydrophone  to  the  whale 

The  cross-correlation  test  used  in  this  analysis  was 
modified  slightly  from  those  described  in  other  studies  of 
signature  vocalizations  (e.g..  Buck  and  Tyack.  1993;  Mc¬ 
Gowan.  1995.  Janik.  1999;  Watwood  et  al..  2005).  In  gen¬ 
eral.  similarity  between  whistle  contours  can  result  either 
coarsely  from  a  general  overlap  in  frequency  range  and/or 
more  finely  from  comparable  frequency  modulation  (e.g.. 
loop  number,  overall  shape).  The  time- in  variant  cross- 
correlation  test  used  here  incorporated  both  of  these  compo¬ 
nents  into  its  final  measurement.  A  continuum  was  possible 
ranging  from  dissimilar  (minimal  frequency  overlap  and  con¬ 
tour  resemblance)  to  very  similar  (maximal  frequency  over¬ 
lap  and  contour  resemblance).  Because  the  first  term  of  the 
product  ranked  similarity  according  to  both  overlap  and  con¬ 
tour  and  the  second  term  according  to  contour  only,  interme¬ 
diate  scores  of  similarity  were  also  possible.  This  was  par¬ 
ticularly  important  since  the  frequency  ranges  of  the  w  histles 
from  the  two  narwhals  were  mostly  nonoverlapping.  The 
possibility  of  contour  shape  resemblance  was  excluded  by 
the  unambiguous  results  of  the  cross-correlation  test 
(Table  III). 

These  findings  suggest  possible  directions  for  future 
work  Tagging  and  recording  the  combined  tonal/pulsed  sig¬ 
nals  and  whistles  of  multiple  narwhals  from  other  groups 
would  provide  data  that  could  support  or  reject  the  conclu¬ 
sions  made  here.  If  these  signals  do  possess  signature  con¬ 
tent.  further  study  could  ascertain  whether  they  are  distinc¬ 
tive  at  the  individual  or  group  level.  Critical  to  determining 
the  function  of  these  vocalizations  w  ill  be  an  assessment  of 
the  behavioral  context  in  which  these  vocalizations  are  pro¬ 
duced  and  the  stability  of  group  composition  and  size  over 
short  and  long  time  scales.  If  narwhals  are  capable  of  difler- 
entiating  between  individuals  acoustically,  quantifying  these 
aspects  of  group  dynamics  would  provide  starting  estimates 
for  the  number  of  animals  w  ith  which  a  single  individual  is 
interacting  and  therefore  between  which  it  should  be  able  to 
distinguish.  Playback  experiments  would  be  useful  for  iden¬ 
tifying  the  temporal  and  spectral  features  of  the  combined 
tonal/pulsed  signals  and  whistles  that  the  animals  may  be 
using  to  facilitate  differentiation.  An  understanding  of  the 


ontogeny  of  these  sounds  to  determine  if  vocal  learning  plays 
any  role  in  their  acquisition  or  development  requires  acoustic 
data  from  the  same  animals  and  their  groups  collected  lon¬ 
gitudinally  over  many  years. 

Signature  whistles  appear  to  be  used  by  bottlenose  dol¬ 
phins  as  contact  calls  in  a  variety  of  contexts  (Janik  and 
Slater,  1998;  Suyigh  et  al..  1998;  Watwood  et  al..  2005).  If 
narwhals,  another  gregarious  odontocete.  similarly  use  their 
combined  tonal/pulsed  signals  and  whistles  as  contact  calls 
when  separated  from  conspecdics.  the  procedure  described 
here  affords  an  opportunity  to  make  recordings  in  ihiv  con¬ 
text  while  the  animals  are  detained  ashore.  Under  this  hy¬ 
pothesis,  an  involuntary  separation  of  the  sort  imposed  here 
would  cause  the  animals  and/or  their  group  members  to  vo¬ 
calize  in  an  effort  to  regain  contact. 

Despite  the  small  sample  size,  the  data  presented  in  this 
manuscript  provide  supportive  evidence  for  at  least  two 
classes  of  signature  vocalizations  among  free-ranging  nar¬ 
whals  at  the  individual  oi  group  level.  Future  work  focused 
on  the  ontogeny,  function,  and  acoustic  characteristics  of  the 
combined  tonal/pulsed  signals  and  whistles  produced  by  nar¬ 
whals  is  required  to  develop  an  improved  understanding  of 
the  vocal  and  social  behavior  of  this  elusive  Arctic  animal. 
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Appendix  5.  Testing  for  Orientation  Responses  of  Individual 
Sperm  Whales  to  a  Varying  Sonar  Exposure  Level* 

A5.1  Abstract 

Research  examining  responses  of  marine  mammals  to  sound  stimuli  is  often  limited 
by  small  sample  sizes  or  the  reporting  of  a  single  exposure  level  for  each  subject,  which  has 
restricted  analysis  of  dose-response  functions.  Here  we  report  on  a  statistical  technique  for 
titrating  a  continuous  behavioral  response  parameter  against  a  range  of  sound  exposure 
levels  for  each  subject.  We  analyzed  the  angular  orientation  responses  of  three  tagged  sperm 
whales  with  respect  to  a  sonar  source  operating  between  2-15  kHz  as  a  function  of  a  varying 
received  sound  exposure  level.  During  each  experiment,  the  source  level  was  gradually 
increased,  or  ramped  up,  and  then  maintained.  A  method  accounting  for  serial  correlation 
and  based  on  circular  regression  was  used  to  test  the  null  hypothesis  of  no  effect  of  received 
level  on  angular  orientation  for  two  whales.  Our  analysis  did  not  find  a  significant  effect  for 
approach  towards  or  avoidance  of  the  source  as  a  function  of  received  level,  90%  of  which 
ranged  from  106-137  dB  re  lpPa2s.  This  statistical  technique  proved  robust  to  test  for 
avoidance  responses  more  generally,  underscoring  its  utility  for  empirical  evaluation  of  the 
assumption  that  animals  will  avoid  harmful  exposures  during  ramp  up  or  a  sound  source 
approach. 

A5.2  Introduction 

The  extent  to  which  marine  mammals  are  influenced  by  anthropogenic  sound 

This  manuscript  has  been  submitted  to  Marine  Mammal  Science  for  publication  with  Peter  L.  Tvack  and 
Andrew  R.  Solow  as  co-authors. 
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sources,  including  sonars  used  to  detect  and  locate  objects  underwater,  is  a  contendous  issue. 
Marine  mammals  have  evolved  mechanisms  to  use  sound  to  communicate  and  orient  in  the 
sea,  whose  physical  properdes  favor  sound  for  rapid  long-distance  communication.  Over  the 
past  century,  humans  have  had  global  impacts  on  the  ocean  acoustic  environment.  Ships 
have  elevated  the  deep  ocean  ambient  noise  by  10-100  fold  and  naval  sonar  exercises  have 
been  found  to  cause  some  whales  to  strand  and  die  (Cox  et  al.,  2006).  The  largest  problem 
for  managing  the  risks  of  sound  involves  our  ignorance  of  its  effects  on  marine  mammals. 

Part  of  the  difficulty  in  measuring  these  effects  analytically  lies  in  defining  the 
qualitative  and  quantitative  nature  of  possible  responses  and  in  determining  their  spatial  and 
temporal  extent.  Individual  and  species  variability  and  a  wide  diversity  of  stimulus  categories, 
exposure  levels,  and  patterns  of  presentation  contribute  to  the  complexity  of  these  analyses. 
Controlled  exposure  experiments  (CEEs),  however,  have  proven  useful  in  addressing 
conservation  concerns  by  describing  dose-response  relationships  for  the  behavioral 
responses  of  animals  to  acoustic  exposure  (Tyack  et  al.,  2003).  In  a  CEE,  the  behavioral 
response  of  one  or  more  focal  individuals  is  monitored  over  time  during  exposure  to  a 
stimulus  with  acoustic  characteristics  that  are  varied  in  a  controlled  fashion.  Most  earlier 
CEEs  have  associated  one  received  level  with  a  behavioral  response  but  given  the  limited 
number  of  subjects  typically  available,  this  has  hindered  the  development  of  dose-response 
functions.  There  is  a  need  to  develop  CEE  protocols  and  associated  analyses  that  allow  a 
rigorous  testing  of  how  behavioral  response  parameters  vary  over  a  range  of  acoustic 
exposure  for  each  subject. 

One  might  expect  animals  to  avoid  an  aversive  stimulus  by  moving  away  from  the 
source.  Many  acoustic  mitigation  protocols  gradually  increase,  or  ramp  up,  an  anthropogenic 
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sound  stimulus  from  the  lowest  level  practicable  to  its  full  operating  intensity.  The  most 
common  goal  of  this  procedure  is  to  allow  an  exposed  animal  the  opportunity  to  detect  the 
sound  at  safe  exposure  levels  and  move  away  from  the  zone  of  potential  injurv  near  the 
source,  as  demonstrated  for  certain  species  of  baleen  whales  (Malme  et  al.,  1984;  McCauley 
et  al.,  2000;  Richardson  et  al.,  1990).  However,  it  is  also  possible  that  ramp  up  may  make  it 
more  likely  for  the  animal  to  habituate  to  the  sound  and  remain  in  the  area.  Another  goal  of 
ramp  up  could  be  to  habituate  animals  to  particular  signals  at  levels  below  which  a 
potentially  risky  response  may  be  evoked.  Our  analysis,  which  was  designed  to  examine 
whether  subjects  showed  avoidance  responses  as  a  function  of  their  acoustic  exposure,  is 
ideal  for  testing  whether  animals  will  remain  in  or  vacate  an  area  when  introduced  to  sounds 
of  steadily  increasing  level,  and  to  define  the  received  sound  exposure  level  at  which  this 
response  occurs. 

This  paper  presents  a  new  analysis  method  aimed  at  identifying  a  response  in  the 
angular  orientation  of  individual  marine  mammals  to  a  sound  source.  The  method  was 
applied  to  data  collected  from  mid-frequency  sonar  tests  on  tagged  sperm  whales  ( Phjseter 
macrocephalus).  Previous  opportunistic  observations  of  sperm  whale  behavior  in  the  presence 
of  sonars  or  pingers  (Watkins  &  Schevill,  1975;  Watkins,  1977;  Watkins  et  al.,  1985;  1993) 
suggested  that  avoidance  responses  may  occur,  but  did  not  involve  rigorous  analyses.  The 
cruises  described  here  have  been  used  for  several  studies:  to  test  whether  mid-frequencv 
sonar  could  detect  sperm  whales;  to  define  the  3D  beam  pattern  of  sperm  whale  clicks 
(Zimmer  et  al.,  2003;  2005);  and  to  determine  whether  the  behavior  of  the  whales  was 
affected  by  the  received  sound  exposure  level  (RL)  of  the  sonar  (this  paper).  Given  the  low 
sample  sizes  that  often  accompany  studies  on  the  effects  of  anthropogenic  noise  on  whales, 
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the  analysis  presented  here  benefits  from  evaluating  behavioral  responses  of  each  individual 
subject  against  a  range  of  acousdc  RLs,  using  each  presentation  of  the  sonar  stimulus  as  a 
data  point.  The  method  addresses  serial  dependence  in  the  dataset  with  an  approach  that 
considers  the  continuous  orientation  response  of  each  sperm  whale  subject  to  varying  RLs 
of  mid-frequency  sonar  when  the  source  was  on  and  of  the  background  noise  level  when  the 
source  was  off. 

Previous  experiments  of  this  type  have  often  taken  the  individual  subject  as  the  unit 
of  analysis  and  utilized  stationary  sources  (Malme  et  al.,  1984)  or  a  relatively  straight  line 
approach  of  the  source  (Miller  et  al.,  2000;  Nowacek  et  al.,  2004).  These  analyses  extracted  a 
single  measurement  for  each  animal  (e.g.,  the  closest  point  of  approach)  as  a  function  of  the 
maximum  RL  at  the  whale  (Richardson  et  al.,  1990).  The  method  presented  here,  however, 
examined  the  possibility  of  a  relationship  between  orientation  response  and  RL  for  an 
individual  animal  whose  relative  position  with  respect  to  the  vessel  was  changing  throughout 
the  exposure.  The  sonar  source  was  moved  in  a  variable,  circling  path  around  the  animal  as  it 
swam,  covering  the  full  360°  several  times  over  during  each  experiment.  Our  analysis  took 
advantage  of  exposing  a  whale  that  could  change  its  orientation  with  respect  to  the  vessel 
much  more  rapidly  than  the  ship  was  moving. 

Any  method  that  uses  a  data  series  from  the  same  subject  must  consider  possible 
serial  correlation  between  consecutive  measurements.  Animal  movements  can  be  serially 
dependent  (especially  for  a  large  whale  over  the  short  intervals  of  1 5s  between  sonar  pings) 
because  the  direction  of  an  individual  at  time  t  can  often  be  predicted  fairly  well  by  its 
orientation  and  position  at  time  t—  1.  The  sequence  of  RLs  was  also  characterized  by  serial 
dependence.  The  rotation  test  used  here  to  assess  significance  controlled  for  serial 
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dependence  in  both  data  streams  by  examining  the  dependence  of  the  sequence  of  angular 
orientation  data  on  the  rotated  sequences  of  RL  data,  preserving  anv  serial  dependence 
across  each  rotation.  More  specifically,  only  the  relative  timing  -  and  not  the  internal  order  - 
of  the  orientation  and  RL  data  series  was  altered  between  rotations,  maintaining  the  serial 
dependence  of  each  data  stream. 

A5.3  Materials  and  methods 
A5.3. 1  Experimental  design 

A  5. 3.1 .1  Field  protocol 

Three  exposures  of  mid-frequency  sonar  were  conducted  with  adult  male  sperm 
whales  from  2001-2003  in  the  Ligurian  Sea,  Italy  using  a  quiet  research  vessel  (R/V  Alliance). 
A  cantilevered  pole  mounted  on  a  rigid  hull  inflatable  boat  was  used  to  attach  a  digital 
archival  tag  (DTAG)  to  a  focal  sperm  whale  (Moore  et  al.,  2001;  Johnson  &  Tyack,  2003). 
The  DTAG  contained  a  hydrophone,  depth  sensor,  and  a  tri-axial  accelerometer  and 
magnetometer,  which  all  recorded  to  flash  memory  (Johnson  &  Tvack,  2003,  see  Table  A5.1 
for  sampling  rates).  After  tagging,  the  vessel  circled  the  focal  animal  for  the  remainder  of  the 
experiment,  maintaining  a  distance  between  150  m  and  4.5  km.  Visual  observers  recorded 
the  bearing  and  range  to  the  focal  animal  during  surfacings  using  reticule  estimates  from 
Fujinon  Big  Eye  25x150  binoculars.  Reticule  numbers  were  converted  to  range  with  Gratran 
1.0.0  (©  J.C.  Gordon)  using  published  calibration  factors  (Kinzev  &  Gerrodette,  2001)  and 
equations  (Gordon,  1990;  Lerczak  &  Hobbs,  1998). 

After  the  whale  was  tagged,  a  pre-exposure  control  period  preceded  the  initiation  of 
exposure  to  sonar  sounds  broadcast  from  an  omnidirectional  source  (Table  A5.1).  The 
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yeaf 

2001 

2002 

2003 

calendar  day 

2  October 

10  July 

8  September 

experiment  duration  (h) 

6.93 

5.12 

7.61 

exposure  start  (h  into  experiment) 

1.01 

2.15 

2.09 

exposure  end  (h  into  experiment) 

3.14 

4.88 

5.66 

duration  of  exposure  (h) 

2.13 

2.73 

3.58 

number  of  pauses  (>  4  min) 

2 

3 

1 

total  length  of  pauses  (h) 

0.66 

0.93 

0.08 

sampling  rate:  hydrophone  (kHz) 

32 

32 

96 

sampling  rate:  movement  sensors  (Hz) 

5.88 

5.88 

5 

sonar  ping  mean  frequencies  (kHz) 

2.6,  3.8,  8.0,  15.0 

2.0,  3.0 

2.0,  3.0 

interval  between  pings  (s) 

15 

15 

15 

input  sonar  ping  duration  (s) 

0.1 

0.1 

0.4 

number  of  complete  dives 

8 

5 

8 

maximum  dive  depth  (m) 

899.6 

1171.0 

903.9 

Table  A5.1.  Descriptive  measurements  made  for  each  experiment.  The  italicized  sonar  ping  frequency  in  2001 
was  near  the  Nyquist  frequency  of  16  kHz.  Because  it  was  not  possible  to  measure  the  RLs  of  these  pings 
reliably,  they  were  excluded  from  the  analysis. 


source  level  (SL)  was  increased  gradually  from  1 50  dB  re  1  pPa  RMS  and  was  operated  to 
maintain  a  received  sound  level  (RL)  at  the  whale  below  160  dB  re  IpPa  RMS,  following  the 
conditions  of  the  research  permit  and  selected  to  be  well  below  that  considered  to  pose  a 
risk  of  injury7.  RL  was  approximated  by  modeling  transmission  loss  for  the  esdmated  range 
to  the  whale.  Sperm  whales  usually  begin  producing  regular  echolocation  clicks  soon  after 
they  dive  but  cease  during  ascent  and  while  at  the  surface  (Madsen  et  al.,  2002;  Miller  et  al., 
2004;  Watwood  et  al.,  2006).  A  hydrophone  array  was  used  to  track  the  tagged  animal  as  it 
was  clicking  (Zimmer  et  al.  2005).  The  sonar  was  only  active  when  the  location  of  the  animal 
was  known  sufficiently  to  ensure  the  exposure  did  not  exceed  the  permitted  limit.  In  2001 
and  2002,  the  sonar  was  turned  off  when  the  animal  stopped  clicking  as  it  started  its  ascent, 
leading  to  pauses  in  the  exposure  exceeding  4  minutes.  In  2001,  the  4  frequencies  were 
repeated  in  a  fixed  sequence  while  in  2002  and  2003,  the  2  and  3  kHz  pings  were  alternated. 
The  interval  between  pings  was  15s  for  all  exposures.  After  exposure,  the  vessel  continued  to 
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circle  the  tagged  whale.  The  DTAG  was  programmed  to  release  from  the  sperm  whale  and  a 
VHF  tracking  beacon  was  used  to  recover  the  tag  for  data  offloading. 

A 5. 3.1 .2  Received  sound  exposure  level  (RLJ  calculations 

The  RL  of  each  sonar  ping  was  calculated  from  the  DTAG  recordings, 
corresponding  to  the  approximate  level  encountered  by  the  tagged  animal.  Echolocation 
clicks  and  buzzes  (Miller  et  al.,  2004)  of  the  focal  or  neighboring  non-focal  animals  coincided 
with  some  of  the  pings.  Because  these  clicks  often  had  sound  pressure  levels  that  were 
substantially  higher  than  the  sonar  pings,  they  were  removed  using  Adobe  Audition  (Adobe 
Systems,  Inc.)  to  prevent  them  from  interfering  with  the  computation  of  RL.  When 
extracting  the  clicks,  small  segments  of  the  sonar  pings  were  also  removed,  but  the  short 
duration  of  the  clicks  compared  to  the  overall  length  of  the  pings  rendered  the  effect  on  the 
RL  calculations  negligible.  Ping  echoes  from  surface  and  bottom  bounces  were  excluded 
from  the  RL  determinations  because  their  amplitudes  were  generallv  at  least  20  dB  lower 
than  the  direct  arrivals.  All  programming  for  the  remainder  of  the  analysis  was  conducted  in 
Matlab  (The  Math  Works,  Inc.). 

The  duration  and  bandwidth  of  the  sonar  pings  changed  between  vears  (Tables  A5.1 
&  A5.2).  A  2-pole  Butterworth  band-pass  filter  was  designed  to  exclude  extraneous,  non¬ 
ping  energy  and  was  centered  at  the  mean  frequency  of  each  ping.  The  -3  dB  endpoints 
serving  as  the  filter  cut-off  frequencies  were  chosen  as  twice  the  bandwidth  flanking  the 
upper  and  lower  frequency  bounds  of  the  ping  to  avoid  filtering  away  any  ping  energy.  The 
RL  of  each  ping  was  calculated  as  the  sound  exposure  level,  or  energy  flux  densitv,  given  bv 
the  sum  of  the  squared  pressure  over  the  duration  of  the  ping  in  dB  re  1  pPa2s: 
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output  (90%  range) 


ping:  year, 
frequency 

bandwidth 

(Hz) 

duration  (s) 

SEL  (dB  re 
lpPa2s) 

SPLrms  (dB  re 
lpPa2s) 

SPLp.p  (dB  re 
l[xPa2s) 

2001,  all  frequencies 

200 

0.07,  0.12 

83, 109 

93,119 

109,133 

2002,  both 

100 

0.07,  0.13 

102, 125 

113, 135 

125, 148 

frequencies 

2003,  2.0  kHz_ 200 


2003,  3.0  kHz  400 


0.30,1.76 


112, 138 


112,  143 


131,  156 


Table  A5.2.  Bandwidth,  duration  and  level  (energy  flux  density  or  sound  exposure  level  (SEL),  sound  pressure 
level  RMS  (SPLrms),  and  sound  pressure  level  peak-peak  (SPLp.p))  of  sonar  frequencies  used  in  all  years.  The 
frequency  of  each  ping  increased  from  fo-V2btofo  +  Vi  b  where  Jo  corresponds  to  the  mean  frequency  given 
in  Table  A5.1  and  b  to  the  bandwidth  listed  here.  The  ranges  listed  here  defined  the  middle  90%  of  the  data 
values. 


RL  =  10 log  f  p2(t)dt  = 


i  rv  i _ I  1 


rT 


+  101og(T) 


where  p(t)  is  the  instantaneous  pressure  as  a  function  of  time  /,  the  duration  T  was  defined  to 
contain  90%  of  the  energy  in  a  window  surrounding  the  ping  (Blackwell  et  al.,  2004;  Madsen, 

-j  r 1  2 

2005),  and  the  term  y  I  p  ( t)dt  corresponds  to  the  squared  pressure  (RMS)  of  the  signal. 


Energy  flux  density  provided  a  realisdc  measurement  of  the  total  amount  of  acousdc  energy 
from  the  sonar  ping  impinging  on  the  tagged  animal.  Because  it  was  not  possible  to  calculate 
the  RL  of  the  pings  centered  near  the  Nyquist  frequency  of  the  tag  recording  (i.e.,  the  15 
kHz  pings  of  2001  sampled  at  32  kHz),  they  were  dropped  from  the  analysis. 

During  the  intervals  when  the  sonar  source  was  off,  the  background  noise  RL  was 
calculated.  These  measurements  were  made  by  selecdng  noise  clips  once  every  1 5s  that 
excluded  echolocadon  clicks  and  were  of  the  same  duradon  as  the  pings,  filtering  them  as 
described  above  by  cycling  through  the  2-4  unique^’s  as  though  sonar  pings  were  present 
and  compudng  the  energy7  flux  density.  Background  noise  calculations  during  creaks  and 


270 


surfacing  periods  were  made  by  averaging  the  preceding  and  succeeding  noise  RL  values. 

For  the  remainder  of  this  paper,  “sonar  RL”  refers  to  the  levels  of  the  sonar  pings  whereas 
“RL”  also  includes  the  periods  without  sonar  where  background  noise  RL  was  quantified 
instead. 

A  5. 3.1 .3  A  nimal  and  vessel  paths 

A  3D  path  of  the  tagged  animal  was  generated  from  the  DTAG  movement  and 
depth  data  (Johnson  &  Tyack,  2003).  This  path  was  geo-referenced  by  range  estimations  of 
the  tagged  individual  at  the  surface  recorded  by  visual  observers  using  binocular  reticules. 
Incorporating  a  ±0.1  reticule  reporting  inaccuracy,  error  ovals  were  calculated  around  the 
coordinate  pairs  of  each  sighting.  The  positions  of  the  animal  were  located  within  their 
respective  error  ovals  to  minimize  the  scaling  of  the  track  segments  between  surfacings.  GPS 
latitude  and  longitude  data  of  the  vessel’s  course  were  converted  to  horizontal  2D 
coordinates  positioned  at  the  water  surface.  Both  animal  and  vessel  tracks  were  re-sampled 
to  1  Hz.  Figures  A5.1  and  A5.2  illustrate  the  corrected  whale  tracks  in  2D  and  3D, 
respectively. 

A5.3.2  Data  preparation  and  statistical  methods 

A  5. 3. 2.1  The  directional  time- series 

A  direcdonal  time-series  containing  the  focal  animal’s  orientation  relative  to  the 
playback  vessel  was  expressed  as  a  single  angle  in  radians  once  each  second.  Values  of  0, 7i/2 
and  ti  corresponded  to  the  animal  facing  direedy  away  from  the  vessel,  broadside  to  the 
vessel,  and  towards  the  vessel,  respectively.  The  direcdonal  time-series  was  produced  by. 
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Figure  A5.1.  The  adjusted  animal  paths  are  displayed  as  the  heavy,  dark  black  lines.  Observer-sighted  locations 
(black  +)  with  ±0.1  reticule  error  ovals  (gray)  are  indicated.  All  sightings  (o)  were  placed  within  (or  on)  their 
associated  error  ellipses  to  minimize  the  manipulation  required  to  adjust  the  remaining  positions.  (Some  of  the 
error  ellipses  are  too  small  to  be  visible  relative  to  the  sighting  circles.)  The  beginning  of  the  whale  track  is 
indicated  (•)  and  the  first  sighting  position  are  marked  (□). 


calculating  the  arccosine  of  the  dot  product  of  the  vector  poindng  from  the  vessel  to  the 
animal  and  the  heading  vector  of  the  whale  (Figure  A5.3).  Because  we  were  concerned  with 
the  animal  poindng  towards  versus  away,  but  not  left  versus  right,  we  used  the  absolute 
value  of  the  poindng  angle. 
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Figure  A5.2.  The  3D  track  of  each  sperm  whale  (thin  black  line)  is  displayed  with  the  surface  track  of  the 
vessel  (thick  gray  line).  The  starting  points  of  the  whale  and  vessel  paths  are  indicated  (•  and  o,  respectively). 

A5.3.2.2  Assumptions 

The  time-series  data  were  determined  by  the  angular  orientation  of  the  whale  and 
position  of  the  whale  and  the  vessel.  The  regression  model  developed  below  assumed  that 
any  dependence  of  the  time-series  orientation  data  on  sonar  RL  was  not  controlled  bv  vessel 
movement  but  instead  was  driven  by  the  whale  altering  its  orientation  relative  to  the  vessel 
as  a  function  of  RL.  This  assumption  was  generally  valid  under  this  experimental  design 
since  the  vessel  was  slowly  steered  to  continue  circling  the  whale  (see  Introduction) 
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Figure  A5.3.  Scenarios  illustrating  the  computation  of  the  directional  time-series.  Scenario  1,  Animal  facing 
away  from  the  vessel:  The  vector  directed  from  the  vessel  to  the  animal  (V)  is  parallel  to  the  vector  of  the 
animal’s  heading  (W).  In  other  words,  |  v0  -  cod\  =0  and  v  -  co  ,  which  means  cos^fV'W)  =  0.  Scenario  2, 
Animal  broadside  to  the  vessel:  Oriented  broadside,  the  value  of  cov  is  always  zero.  In  addition,  because  |  v0- 
cob  |  =  7t/2,  cos_1(V*W)  =  n/2.  Scenario  3,  Animal  facing  towards  the  vessel:  V  and  W  are  anti-parallel.  Now,  |  v6 
-  co0 1  =7i  and  v  ~  -  cop  so  cos^fV’W)  =  7t. 


independent  of  changes  in  RL,  while  the  whale  could  change  orientation  in  a  matter  of 
seconds.  Other  patterns  in  the  movement  of  the  vessel  may  have  unintentionally  influenced 
RL,  however,  violating  this  critical  assumption.  For  example,  the  vessel  might  have  steered  in 
front  of  an  animal  that  was  swimming  with  a  fairly  constant  heading.  As  the  vessel  pulled  in 
front  of  the  animal’s  path  and  the  distance  between  the  vessel  and  animal  closed,  the  RL 
would  consequendy  increase  (given  a  constant  SL).  Because  we  assumed  that  an  angular 
orientation  changing  as  a  function  of  RL  was  due  to  the  whale,  it  would  have  appeared  that 
the  animal  was  orienting  more  towards  the  vessel  with  closing  range  and  a  corresponding 
increase  in  RL.  Data  characterized  by  this  phenomenon  were  excluded  from  the  final 
analysis. 


A  5. 3. 2. 3  Circular  regression  on  residual  deviations  from  path  trend 

To  determine  whether  RL  affected  the  movement  behavior  of  the  focal  animal,  a 
regression  of  the  directional  time-series  on  RL  was  conducted.  Because  the  angular  response 
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was  a  circular  variable,  ordinary  regression  analysis  was  not  appropriate.  We  used  |  6  \ , 
ranging  between  0  and  71,  as  our  orientadon  variable  because  we  were  not  interested  in 
whether  whales  turned  left  or  right  but  only  towards  or  away.  Let  6  be  the  true  orientadon  of 
the  focal  animal  reladve  to  the  vessel,  which  ranged  between  ±71.  The  basic  assumpdon 
underlying  the  analysis  was  that  the  probability  density7  funcdon  (pdf)  of  6  has  the  form: 

m  =  j  VM(0;  //(RL),  x)  +  ±  VM(0;-//(RL),  x) 

where 

VM($;  //,  x)  =  [2nl0(x)]'1  exp[*cos (6  -  //)] 

is  the  von  Mises  pdf  with  mean  //,  concentradon  parameter  *  and  the  zero-order  Bessel 
funcdon  7q(#)  .  The  von  Mises  distribudon  is  the  standard  distribudon  used  in  modeling 

circular  data  (see  Batschelet,  1981  for  a  review).  Under  the  model  in  (A5.1),  6  followed  a 
mixture  of  two  von  Mises  distribudons  with  opposite  means,  equal  scales  and  equal  mixing 
weights.  This  mixture  model  reflected  the  physical  and  behavioral  equivalence  of  an  animal 
orienung  to  the  left  or  to  the  right  of  the  vessel  by  the  same  angle.  The  mean  function 
//(RL)  was  assumed  to  have  the  form: 

a(RL)  =  £>  +  *  RL) 

where  q  is  a  constant  and g{u)~  2tan  '(//)  (Fisher  &  Lee,  1992;  Fisher,  1993).  It  is 
straightforward  to  show  that  the  corresponding  pdf  of  |  6\  is  given  bv: 

K I  6 1 )  =  [2^/oWr1  (exp[^cos(  1 6 1  -//(RL))]  +  exp[^rcos((9  +  ^(RL))]). 

The  parameter  ft  in  this  model  (see  (A5.3))  governed  the  dependence  of  |  0\  on  RI.. 
The  null  htpothesis,  H0:  /?  =  0,  that  there  was  no  such  dependence  could  be  tested  against 
the  general  alternadve  hypothesis,  H,:  (}  i1  0,  using  the  likelihood  ratio  statistic  defined  as: 


(A5.1) 


(A  5.2) 


(A5.3) 


(A5.4) 
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A  =  2(log(L,)-log(L0)) 


(A5.5) 


where  log^)  and  log^)  are  the  maximized  values  of  the  log  likelihood  under  H,  and  H0, 
respecdvely.  The  log  likelihood  for  this  model  is: 

L(gJ,x)  =  -fj\ogI0(x)  + 

n 

2l°g(exp[*cos(|0  / 1  —  A(  rl  y ))]  +  exp  [  cos  ( I  <9y  |  +//(RL  y))]) 

j= 1 

where  the  observations  consist  of  pairs  (|  6ft ,  RL;),y  =  1,2,  The  maximum  likelihood 
(ML)  estimates  of  the  parameters  q,  ft  and  x  under  H]  were  found  by  maximizing  (A5.6) 
using  a  constrained  nonlinear  optimization  routine.  The  procedure  was  identical  under  fJ0 
except  that  the  optimization  was  subject  to  the  restriction  that  ft  -  0.  In  both  cases, 
maximization  was  conducted  numerically  under  the  identifiability  constraint  //(RL)  >  0  .  In 
undertaking  this  analysis,  we  considered  the  possibility  of  a  delayed  response  to  sonar  RL, 
which  would  have  indicated  that  the  animal  was  responding  to  the  stimulus  after  a  constant 
time  delay.  We  lagged  the  values  of  sonar  RL  used  in  fitting  the  unrestricted  model  and 
compared  alternative  lags  via  the  maximized  log  likelihood  but  because  there  was  no 
evidence  of  a  delayed  response,  we  proceeded  by  using  synchronous  observations  of  |  6\  and 
RL. 

A  5. 3. 2 A  Testing  significance 

A  randomization  test  was  conducted  to  determine  whether  the  null  hypothesis  could 
be  rejected  in  favor  of  the  alternative.  To  account  for  the  serial  dependence  of  both  the  RL 
and  |  6\  data,  we  preserved  the  order  of  each  data  stream  in  the  randomization  test.  Under 
the  null  hypothesis  that  the  animal’s  angular  response  was  independent  of  RL,  the  |  8\  data 


(A5.6) 


276 


were  rotated  (with  their  order  maintained)  by  a  random  amount  relative  to  the  RLs  (also  with 
their  order  maintained)  and  a  new  maximum  log  likelihood  and  set  of  maximum  likelihood 
estimates  of  the  parameters  were  calculated  for  the  null  and  alternative  models.  The  rotation 
test  was  conducted  100  dmes  and  the  likelihood  ratio  test  was  used  as  described  above  to 
compare  the  fit  of  the  rotated  data  under  both  models.  This  test  assessed  whether  the 
introduction  of  the  parameter  /?  into  the  model,  indicating  some  dependence  of  angular 
response  on  RL,  yielded  a  significandy  better  fit  of  the  actual  data  than  under  the  null 
compared  to  the  rotated  datasets.  The  value  of  A  from  the  original,  svnchronous  data  was 
compared  to  the  distribution  of  A  computed  from  the  rotated  datasets  to  determine  the 
significance  level.  This  test  assumed  no  dependence  of  the  angular  response  on  RL  when 
calculadng  the  distribution  of  A. 

A5.4  Results 

Descriptive  measurements  and  plots  of  the  ping-bv-ping  sound  exposure  levels  (RL) 
received  at  each  whale  are  presented  for  each  controlled  exposure  in  Table  A5.1  and  Figure 
A5.4.  In  all  three  years,  the  source  level  (SL)  of  the  sonar  began  at  150  dB  RMS  re  lpPa  at 
lm,  yielding  an  RL  at  the  whale  below  the  noise  floor.  As  the  sonar  SL  was  gradually  ramped 
up,  the  target  RL  grew  each  year  with  the  mean  RL  increasing  from  100  in  2001  to  1 18  in 

2002  to  129  dB  re  lpPa2s  in  2003  (for  90%  ranges  and  other  RL  units,  see  Table  A5.2).  In 
2001,  there  were  3  exposure  periods  separated  by  pauses  exceeding  4  min,  and  the  average 
sonar  RL  for  each  exposure  period  was  91,  106,  and  101  dB  re  lpPa:s.  There  were  4  such 
periods  in  2002  with  average  RLs  of  92,  116,  119,  and  122  dB  re  lpPa^s.  The  two  intervals  in 

2003  had  average  RLs  of  129  and  136  dB  re  lpPa^s.  All  experiments  exposed  the  animals  to 
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Figure  A5.4.  Received  sound  exposure  level  data  plotted  as  energy  flux  density  (dB  re  1  piPa2s)  as  a  function  of 
time  for  2001-2003. 


increasing  average  sonar  RL  with  each  successive  exposure  period  except  2001.  In  2001  and 
2003,  lower  frequency  pings  averaged  about  1  dB  higher  than  higher  frequency  pings;  in 
2002,  the  average  difference  between  frequencies  was  less  than  1  dB. 

The  path  morphologies  of  the  animals  differed  substantially  between  the  years 
(Figures  A5.1  &  A5.2).  In  2001,  the  sperm  whale  traveled  mainly  towards  the  northwest 
while  the  ship  circled  the  whale  at  a  range  that  varied  between  1.2  and  5.2  km,  a  pattern 
confirmed  using  a  passive  sonar  system  for  acousdc  localization  of  the  animal  (Zimmer  et 
al.,  2005).  The  tagged  animal  followed  a  curved  path  in  2002  with  a  small  loop  towards  the 
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end  while  in  2003  the  sperm  whale  traced  a  more  circuitous  path  initially,  eventually  settling 
on  a  westward  travel  direction.  The  number  of  complete  dives  and  the  maximum  depth 
achieved  by  each  animal  are  listed  in  Table  A5.1.  The  resulting  directional  time-series  data 
reveal  low  frequency  trends  from  the  vessel’s  movement  about  the  animal  superimposed  on 
high  frequency  fluctuations  in  the  animal’s  heading  (Figure  A5.5). 

We  excluded  the  2001  dataset  from  subsequent  analysis  because  the  vessel  passed  in 
front  of  the  whale,  which  was  continuing  on  a  steady  course.  This  violated  the  assumption 
described  in  the  Methods  that  the  motion  of  the  vessel  did  not  cause  an  interaction  between 
RL  and  orientation  of  the  whale  with  respect  to  the  vessel.  The  whale  tracks  in  2002  and 
2003  were  more  variable  and  we  did  not  observe  the  same  kind  of  interaction  as  seen  in 
2001.  We  concluded  that  retaining  the  2002  and  2003  datasets  for  the  remainder  of  the 
analysis  was  appropriate. 

The  analysis  was  designed  to  test  whether  there  was  a  relationship  between  RL  and 
each  whale’s  orientation  with  respect  to  the  sound  source.  This  allowed  testing  for  effects 
such  as  increased  avoidance  with  increased  exposure  compared  to  a  null  hypothesis  of  no 
relationship  between  the  whale’s  orientation  and  RL.  In  both  2002  and  2003,  we  were  unable 
to  reject  the  null  hypothesis  in  favor  of  the  alternative  because  A  was  not  significant  (2002:  P 
—  0.88,  2003:  P  =  0.73).  No  visible  relationship  was  evident  between  the  angular  variable 
|  6 1  and  RL  (Figure  A5.6).  In  particular,  substantial  amounts  of  spread  characterized  the  |  6  \ 
data  both  when  the  source  was  both  on  and  off.  Similar  amounts  and  ranges  of  spread 
among  the  angular  orientation  data  were  also  found  when  comparing  the  ramp-up  periods 
and  the  span  immediately  preceding  them,  reinforcing  the  conclusion  of  no  effect.  A  non- 
uniform  distribution  of  |  6\  is  present  in  2002  with  data  mostly  absent  from  the  “towards” 
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Figure  A5.5.  Directional  time-series  data  plots.  The  ordinate  ranges  from  0  (away  from)  to  n  (towards)  as 
illustrated  in  Figure  A5.3.  Black  horizontal  lines  at  the  top  of  the  plots  correspond  to  the  intervals  during  which 
the  sonar  source  was  active.  The  low  frequency  trend  of  the  vessel’s  path  and  the  high  frequency  jitter  of  the 
animal’s  movements  are  simultaneously  visible  in  these  plots. 


condition  at  all  RLs  (Figure  A5.6).  While  it  is  possible  that  the  whale  was  responding  to  the 
quiet  ship  when  the  sonar  was  off,  we  believe  that  the  vessel’s  slow  turning  rate  as  it  traced 
one  larger  loop  followed  by  a  smaller,  shorter  loop  allowed  the  whale  to  remain  slighdy 
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Figure  A5.6.  Orientation  of  whale  with  respect  to  the  source  vessel  plotted  as  |  0\  vs.  sonar  RF  (•)  and  noise 
RL  (  ).  The  ordinate  range  is  the  same  as  in  Figure  A5.4. 


ahead  of  the  vessel  for  the  duration  of  the  experiment.  By  monitoring  the  orientation  of  the 


whale  as  a  function  of  RL  in  both  the  presence  and  absence  of  sonar,  we  can  conclude  that 


this  non-uniform  distribution  was  a  consequence  of  the  geometry  of  the  ship  circling  the 


whale  and  did  not  result  from  the  sound  exposure. 


A5.5  Discussion 

A  directional  time-series  of  angular  orientations  was  calculated  from  the  3D  path  of 
tagged  sperm  whales  relative  to  a  moving  sonar  source  vessel  in  three  separate  controlled 
exposure  experiments  (CEEs).  The  experiment  from  2001  was  discarded  from  the  full 
analysis  because  the  angular  response  variable  could  have  been  affected  bv  the  movements 
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of  the  vessel  as  well  as  by  the  animal.  We  were  unable  to  reject  the  null  hypothesis  of  no 
dependence  of  angular  orientation  on  received  sound  exposure  level  (RL)  in  2002  and  2003, 
suggesting  that  increasing  RL  over  the  range  studied  did  not  elicit  a  systematic  angular 
response  of  the  animals  towards  or  away  from  the  sonar  source.  The  two  whales  behaved 
similarly  in  response  to  rather  different  exposure  ranges:  the  minimum  and  maximum  RLs  in 
2003  were  roughly  1 6  and  24  dB  higher  than  in  2002,  respectively. 

A  substantial  benefit  of  the  analytical  approach  developed  here  is  the  ability  to  derive 
meaningful  results  from  each  whale  in  a  small  number  of  total  experiments,  each  involving 
many  serially  dependent  exposures.  Earlier  studies  that  associated  each  exposed  individual 
with  a  single  RL  (see  Introduction)  required  dozens  of  experiments  to  develop  an  adequate 
sample  size  for  testing  how  responses  varied  with  RL,  a  difficult  undertaking  given  field 
costs,  logistical  challenges,  and  regulatory  limitations.  Our  analysis,  however,  was  able  to 
evaluate  a  behavioral  response  parameter  against  a  large  range  of  exposure  levels  for  each 
individual  subject,  controlling  for  serial  dependence  in  the  dataset.  Ideally,  only  a  handful  of 
experiments  would  be  required  to  gather  enough  information  to  relate  behavioral  responses 
to  a  range  of  RLs.  However,  extrapolating  results  from  individuals  to  the  population  level 
would  require  conducting  these  tests  on  at  least  a  few  representatives  of  each  relevant  age 
and  sex  class  and  within  each  meaningful  seasonal  and  behavioral  context.  The  more  variable 
the  dose-response  curves  between  individuals,  the  larger  the  sample  size  of  subjects  that 
would  be  required  to  draw  meaningful  conclusions  and  make  informed  recommendations 
with  respect  to  the  entire  population. 

The  start  of  the  exposure  protocol  used  here  involved  steadily  increasing  the  RL 
(Figure  A5.4)  in  a  manner  consistent  with  the  operation  of  anthropogenic  acoustic  signals 
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that  slowly  approach  while  transmitting,  with  mitigation  measures  that  ramp  up  the  level  of  a 
source,  and  with  an  animal  slowly  approaching  a  stationary  sound  source.  Ramp  up 
procedures  have  commonly  relied  on  the  premise  that  animals  will  move  away  from  a  sound 
source  at  levels  well  below  the  threshold  that  causes  injury.  By  the  time  the  sound  stimulus 
nears  its  full  source  level  (SL),  the  exposed  animals  will  ideally  have  maneuvered  to  a  safer 
range  where  the  risk  of  injury  from  the  stimulus  has  been  reduced.  But  it  has  also  been 
suggested  that  ramp  up  may  make  animals  more  likely  to  habituate  to  lower  levels  of 
anthropogenic  sounds,  which  may  be  beneficial  if  it  prevents  animals  from  panicking  at 
higher  (but  non-injurious)  levels.  Alternatively,  habituation  could  be  detrimental  if  it  led  to 
lack  of  avoidance  of  harmful  stimuli.  While  ramp  up  has  become  an  established  mitigation 
protocol,  there  have  been  few  studies  on  whether  the  assumed  avoidance  behavior  or 
habituation  actually  takes  place  (but  see  Malme  et  al.,  1984;  McCauley  et  ah,  2000; 

Richardson  et  ah,  1990).  The  predictions  that  ramp  up  will  stimulate  avoidance  or  cause 
habituation  and  potentially  reduce  responsiveness  seem  contradictor,  which  highlights  the 
need  for  empirical  studies  on  its  effectiveness  in  meeting  management  goals.  The  method 
presented  here  can  be  used  to  quantify  how  any  behavioral  response  relates  to  acoustic 
dosage.  Using  orientation  as  the  response  measure,  our  analysis  can  test  the  common 
assumption  that  animals  will  demonstrate  avoidance  behavior  during  ramp  up. 

When  designing  experiments  to  test  the  effects  ot  ramp  up  versus  an  approach  with 
constant  SL  on  avoidance  behavior,  it  is  important  to  both  titrate  the  levels  at  which 
avoidance  might  happen  during  a  realistic  exposure  sequence  and  to  balance  with 
appropriate  controls.  The  RLs,  for  example,  should  be  spread  more  uniformly  between  the 
minimum  and  maximum  exposure  levels  to  avoid  concentrating  the  data  over  the  narrow 
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exposure  ranges  analyzed  here.  Although  this  could  be  achieved  by  randomizing  RL  on  a 
ping-by-ping  basis  between  a  lower  and  upper  bound,  this  proposal  is  hardly  realisdc  because 
it  eliminates  both  the  very  structure  of  the  ramp  up  and  the  animal’s  ability  to  anticipate  and 
respond  to  a  monotonically  changing  trend  in  exposure  level.  Another  possibility  involves 
the  source  circling  the  animal  to  maintain  a  constant  range  and  balancing  ramp  up  with  ramp 
down  exposures  to  cover  the  full  RL  space  by  changing  the  SL.  This  scenario  is  unrealistic 
for  most  sonars  and  seismic  surveys,  however,  because  vessels  do  not  circle  the  animals  that 
they  encounter.  Most  ships  towing  a  hydrophone,  for  example,  must  travel  in  a  straight  line 
during  data  collection,  which  causes  a  ship  to  pass  by  animals  at  varying  ranges. 

Instead,  we  recommend  introducing  each  individual  subject  to  multiple  pass  bys, 
each  involving  either  a  ramp  up  or  a  transmission  using  a  constant  SL.  The  order  of 
presentation  would  be  semi-randomized  so  that  the  first  two  exposure  regimes  would  be 
different  (i.e.,  ramp  up  and  then  constant  or  constant  and  then  ramp  up),  allowing  an 
exploration,  for  example,  of  whether  ramp  up  can  facilitate  habituation.  To  test  whether  the 
vessel’s  position  with  respect  to  the  whale  may  impact  response,  the  design  would  involve 
balanced  linear  pass  bys  involving  passing  the  vessel  in  front  of  and  behind  the  animal 
(summarized  in  Figure  A5.7).  This  setup  effectively  titrates  the  dose-response  curve  by 
allowing  full  coverage  of  the  RL  and  bearing  (from  the  vessel  to  the  animal  after  folding 
from  6  to  \6\)  spaces,  allowing  the  animal  to  select  its  orientation  as  the  vessel  moves  and 
controlling  for  possible  position  effects  to  avoid  the  problem  introduced  by  the  2001 
dataset.  Multiple  exposures  within  a  single  experiment  allow  comparisons  of  the  direction 
and  magnitude  of  responses  across  trials  to  test  for  habituation  or  sensitization  across  pass 
bys  as  well  as  within  a  single  pass  by.  This  experimental  design  therefore  explores  whether 
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Quadrant:  III,  I, 
Ramp: 


I,  III,  I,  I,  III, 

t  T  ~  t 


Figure  A5.7.  (a)  Proposed  experimental  design  for  controlled  exposure  experiments  involving  ramp  up.  Once 
the  general  heading  trend  of  the  study  animal  (thick,  dark  gray  line)  can  be  determined,  the  vessel  can  either 
close  its  distance  with  the  whale  by  passing  in  front  from  broadside  in  quadrant  1  or  move  behind  from 
broadside  in  quadrant  III  as  the  whale  swims  away  from  the  path  of  the  vessel  (thick,  dotted  black  lines).  The 
source  can  either  be  ramped  up  (f)  or  maintained  at  a  constant  level  (  ).  Each  exposure  period  therefore 

allows  four  possible  combinations  of  vessel  movement  and  source  level  trend,  (b)  A  sequence  was  determined 
using  a  semi-randomized  order;  one  sample  realization  is  listed  here.  This  design  covers  the  full  RL  space  and 
all  bearing  angles  from  the  vessel  to  the  animal  (after  folding  from  0  to  |  0\)  and  balances  vessel  approaches 
with  retreats  and  increasing  with  constant  levels.  See  text  for  further  discussion. 


behavioral  responses  are  influenced  by  exposure  level,  range  to  the  source  and  the  approach 
or  retreat  of  the  vessel.  Analyses  from  such  a  protocol  could  also  explore  how  animals 
respond  as  a  funedon  of  time  and  repeated  exposure. 

The  analyucal  approach  presented  here  garners  the  full  value  of  each  exposure  bv 
evaluaung  a  condnuous  behavioral  measure  in  relauon  to  a  full  range  of  exposure  levels.  We 
also  propose  an  experimental  design  and  analysis  to  test  whether  whales  do  avoid  sound 
sources  at  levels  below  those  that  pose  a  risk  of  injury,  similar  to  the  kinds  of  studies 
conducted  with  baleen  whales  (Malme  et  al.,  1984;  McCauley  et  al.,  2000;  Richardson  et  al., 
1990).  This  is  a  cridcal  assumpuon  used  to  jusdfy  many  midgauon  measures,  but  has  not 
been  well  tested,  especially  for  odontocetes.  As  an  applied  research  approach  used  to 
develop  and  implement  effeedve  and  safe  mitigadon  standards,  CEEs  are  a  useful  method  to 
provide  realisdc  assessments  of  how  anthropogenic  sounds  actually  impact  behaviorallv 
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complex  marine  mammals  in  a  variety  of  specific  settings,  and  to  test  whether  common 
mitigation  measures  actually  achieve  their  conservation  goals. 
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APPENDIX  6.  Transmission  Loss  Patterns  from  Acoustic 
Harassment  and  Deterrent  Devices  Do  Not  Always  Follow 
Geometrical  Spreading  Predictions* 

A6.1  Abstract 

Acoustic  harassment  and  deterrent  devices  have  become  increasinglv  popular 
mitigation  tools  for  negotiating  the  impacts  of  marine  mammals  on  fisheries.  The  rationale 
for  their  variable  effectiveness  remains  unexplained  but  high  variability*  in  the  surrounding 
acoustic  field  may  be  relevant.  In  the  present  study,  the  sound  fields  of  (me  acoustic 
harassment  device  and  three  acoustic  deterrent  devices  were  measured  at  three  studv  sites 
along  the  Scandinavian  coast.  Superimposed  onto  an  overall  trend  of  decreasing  sound 
exposure  levels  with  increasing  range  were  large  local  variations  in  sound  level  for  all  sources 
in  each  of  the  environments.  This  variability  was  likelv  caused  bv  source  directionality*,  inter¬ 
ping  source  energy  level  variation  and  multi-path  interference.  Rapid  and  unpredictable 
variations  in  the  sound  level  as  a  function  of  range  deviated  from  expectations  derived  from 
spherical  and  cylindrical  spreading  models  and  conflicted  with  the  classic  concept  of 
concentric  zones  ot  increasing  disturbance  with  decreasing  range.  Under  such  conditions, 
animals  may  encounter  difficulties  when  trying  to  determine  the  direction  to  and  location  of 
a  sound  source,  which  may  complicate  or  jeopardize  avoidance  responses. 

A6.2  Introduction 

Marine  mammals  interact  with  aquaculture  and  fisheries  in  a  variety*  of  ways.  Thev 


*  This  manuscript  has  been  submitted  to  Marine  Mammal  Science  for  publication  with  |akob  Tougaard,  Poul  Boel 
Jorgensen,  Line  A.  Kvhn,  Jeppe  Dalgaard  Balle,  Cristina  Bernardez,  Arne  Fjalling,  Junita  Karlsen  and  Magnus 
Wahlberg  as  co-authors. 
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can  consume  stocks  or  catch  directly,  inflict  harm  upon  the  catch  and  the  fishing  gear, 
introduce  fecal  coliform  bacteria  or  parasites,  and  become  severely  or  fatally  caught  in  the 
gear  (reviewed  in  Hammond  &  Fedak,  1994;  Dawson  et  al.,  1998;  Nash  et  al.,  2000).  These 
interactions  should  be  limited  both  to  protect  the  animals  and  to  reduce  the  economic  losses 
incurred  by  the  fisheries.  Acoustic  approaches  have  been  developed  to  alert  the  animals  to 
the  presence  of  gear  or  to  encourage  them  to  vacate  an  area  (see  Jefferson  &  Curry,  1996  for 
a  review).  Repeated  usage  of  an  offensive  stimulus,  however,  can  lead  to  habituation, 
sensitization,  attraction  (once  the  sound  has  been  associated  with  the  presence  of  food)  or,  if 
loud  enough,  hearing  damage.  The  use  of  gunshots,  explosives,  firecrackers  and  biological 
sounds  have  been  largely  ineffective  in  deterring  marine  mammals  from  fisheries,  possibly 
for  the  reasons  mentioned  above  (Shaughnessy  &  Semmelink,  1981;  Jefferson  &  Curry, 

1996). 

The  playback  of  artificial  sounds  intended  to  mitigate  conflicts  between  marine 
mammals  and  fisheries  have  met  with  mixed  results.  Such  playback  devices  can  be  separated 
into  two  categories.  Low  level  acoustic  deterrent  devices  (ADDs,  commonly  referred  to  as 
“pingers”)  are  designed  to  displace  animals  temporarily  from  a  region.  On  the  other  hand, 
high  level  acoustic  harassment  devices  (AHDs,  or  “seal  scarers”)  are  loud  enough  to  cause 
pain  and  discourage  predation  (e.g.,  Milewski,  2001).  ADDs  and  AHDs  differ  in  their 
output  source  energy  levels  (SLs)  and  frequency  bands.  ADDs  typically  operate  in  the  10-100 
kHz  band  and  emit  SLs  below  150  dB  re  1  pPa2s  @  1  m,  whereas  AHDs  operate  mainly 
between  5  and  30  kHz  at  levels  often  exceeding  170  dB  re  1  juPa2s  @  1  m  (Northridge  et  al., 
2006).  (See  Madsen,  2005  for  an  explanation  of  level  measurements  and  units.) 

ADDs  and  AHDs  are  currendy  used  to  mediate  many  marine  mammal-fishery 
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interactions  worldwide.  After  introducing  ADDs,  several  studies  have  documented  actual 
changes  in  the  behavior  of  harbor  porpoises  ( Phocoena phocoena ),  one  of  the  species  most  at 
risk  of  by-catch,  leading  to  a  reduction  in  entanglement  (e.g.,  Kraus  et  al.,  1997;  Trippel  et  al., 
1999)  and  in  local  abundance  (Johnston,  2002;  Olesiuk  et  al.,  2002).  More  than  half  of  the 
New  Zealand  Hector’s  dolphins  {Cephalorhynchus  hector, i)  observed  in  one  study  avoided 
“white  pinger”  ADDs  (manufactured  by  Dukane  ®,f0  =  9.6  kHz,  pulse  length  =  400  ms) 
attached  to  gillnets  (Stone  et  al.,  2000).  In  a  trial  involving  AHDs  in  the  Baltic  Sea, 
depredation  losses  of  salmon  in  traps  due  to  gray  seals  (Halicboerus  giypus)  were  halved, 
doubling  the  landed  catch  (Fjalling  et  al.,  2006).  Also,  killer  whales  ( Orcinus  orca)  were 
strongly  displaced  by  AHDs  in  a  study  conducted  in  British  Columbia  (Morton  &  Svmonds, 
2002).  As  a  result  of  these  kinds  of  findings,  ADDs  and  AHDs  have  become  increasinglv 
popular  for  abating  marine  mammal  interactions  with  fisheries  (Johnston  &  Woodlev,  1998). 
Indeed,  pingers  are  now  mandator}'  in  several  types  of  gill-net  fisheries  around  the  world  and 
have  been  suggested  as  a  possible  mitigation  solution  to  by-catch  associated  with  commercial 
trawling  (de  Haan  et  al.,  1997;  Reeves  et  al.,  2001). 

Not  all  experiments,  however,  have  encountered  this  level  of  success.  Cox  et  al. 
(2001)  reported  habituation  of  free-ranging  harbor  porpoises  to  one  Dukane  NetMark  100 
pinger  (10  kHz,  132  dB  re  IpPa  @  lm).  These  animals  partially  habituated  to  both  Airmar 
(10  kHz,  132  dB  re  lpPaRMS  @  lm)  and  SaveWave  Black  Save  pingers  (30-160  kHz,  155  dB 
re  l(j.PaRMS  @  lm)  over  a  48-day  course  involving  repeated  activation  and  deactivation  of 
these  devices  (Jorgensen,  2006).  Quick  et  al.  (2004)  reported  survey  results  indicating  that 
despite  the  elevated  usage  of  AHDs,  damage  to  Scottish  marine  salmon  firms  bv  harbor 
( Phoca  vitulina)  and  gray  seals  increased  between  1987  and  2001.  Similarly,  sea  lions  ( Otaria 
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flavescens)  damaged  catches  in  gillnets  containing  active  pingers  more  often  than  those 
without  pingers  (Bordino  et  al.,  2002).  The  by-catch  levels  of  Franciscana  dolphins 
( Pontoporia  blainvillei),  however,  did  fall  in  this  same  study  when  the  pingers  were  active.  The 
mechanisms  leading  cetaceans  and  pinnipeds  to  avoid  or  become  attracted  to  fishing 
operations  with  functional  ADDs  and  AHDs  remain  uncertain  (Kraus,  1999;  Quick  et  al., 
2004;  but  see  Akamatsu  et  al.,  1996;  Kastak  et  al.,  2005;  Kastelein  et  al.,  2006  for 
explorations  of  tolerance  and  habituation  thresholds  in  seals  and  sea  lions).  This  calls  for 
research  that  examines  how  ADDs  and  AHDs  actually  function  and  transmit  signals  into  the 
water.  Quantifying  the  sound  exposure  level  (SEL)  of  these  devices  will  yield  an  improved 
understanding  of  the  acoustic  field  to  which  animals  are  exposed  when  approaching  a  pinger 
underwater.  Simple  spherical  and  cylindrical  spreading  models  and  their  associated  zones  of 
increasing  impact  with  decreasing  range  (Richardson  et  al.,  1995)  may  not  be  applicable  for 
sound  transmission  in  every  instance  (e.g.,  DeRuiter  et  al.,  2006;  Madsen  et  al.,  2006). 
Although  Terhune  et  al.  (2002),  for  example,  depicted  that  received  levels  varied  greatly  as  a 
function  of  range  for  AHDs  in  the  Bay  of  Fundy,  Canada,  the  sound  field  of  an  ADD  in  the 
same  area  displayed  less  variability  with  range  (e.g.,  Cox  et  al.,  2001). 

The  nature  of  the  sound  field  may  be  highly  dependent  on  several  factors  including 
geographic  location,  habitat  morphology,  the  time-frequency  characteristics  of  the  emitted 
signals,  and  the  depth  of  source  and  receiver.  Shallow  water  can  lead  to  multi-path 
propagation  in  which  sound  reflected  off  of  both  the  water’s  surface  (including  associated 
wave  action)  and  the  ocean  bottom  interferes  constructively  and  destructively  to  create  a 
complicated  pattern  of  signal  intensity  as  a  function  of  range.  This  phenomenon  may  make 
it  quite  difficult  to  move  away  from  a  sound  source  by  swimming  down  an  intensity  gradient 
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in  order  to  minimize  exposure.  A  detailed  characterization  of  the  sound  fields  of  these 
devices  is  needed  to  understand  their  possible  influence  on  marine  mammal  behavior. 

In  this  study,  we  test  whether  typical  ADD  and  AHD  signals  propagate  according  to 
the  spherical  or  cylindrical  spreading  that  is  generally  assumed  when  discussing  zones  of 
increasing  impact  (Richardson  et  al.,  1995).  We  also  explore  the  issue  of  variable  SELs  at 
close  and  distant  ranges  to  several  types  of  pingers  and  a  single  AHD  in  three  shallow  water 
environments  in  Sweden  and  Denmark. 

A6.3  Materials  and  Methods 
A6.3.1  Field  sites 

Three  study  sites  were  selected  for  the  sound  transmission  experiments  (Figure 
A6.1).  The  first  was  situated  in  a  bay  south  of  the  island  of  Salto,  Sweden  (referred  to  here 
as  the  “Salto”  field  site,  58°51.7’N,  11°08.6’E).  The  bottom  of  the  bay  was  relatively 
smooth,  1 3-20  m  deep  and  was  comprised  of  a  mixture  of  mud  and  sand  patches.  Salto  was 
utilized  on  5  June  (SSs  for  Salto,  Sweden,  summer)  and  23,  24  and  29  September  2005  (SSf 
for  Salto,  Sweden,  fall).  The  second  field  site,  used  on  23,  24,  and  29  September  2005,  was 
located  in  another  bay  on  the  eastern  side  of  the  island  of  Sydkoster  (referred  to  here  as  the 
“Kosterhamn”  or  KSf  field  site,  58°52.7’N,  1 1°05.4’E).  The  sandy  seafloor  graded  smoothly 
from  a  depth  of  12  m  where  the  experiment  was  conducted  to  more  than  20  m  at  the 
entrance  of  the  deep  fjord.  The  final  site  employed  on  9  September  2005  was  located  in  the 
shallow,  sloping  waters  (5-15  m)  of  Jammerland  Bay,  Storebadt,  Denmark  (called 
“Jammerland”  or  JDf  here,  55°36.0’N,  1 1°05.1’E)  and  was  characterized  bv  a  hard,  sandv 
bottom.  These  sites  were  representative  of  locations  with  respect  to  depth,  topography,  and 
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Figure  A6.1.  Maps  of  study  locations. 


bottom  structure  where  pingers  have  been  deployed  by  the  fisheries.  For  all  sites,  sea  state 
varied  between  0  and  2  during  recordings. 
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Sound  source 

Manufacturer 

Field  site3 

Approximate 
source  level 
(dB  re  1  pPa 
RMS  @  1  m) 

Frequency 
(kHz)  * 

Signal  typeb 

Average 
duration  (ms) 

ADD 

Airmar 

SSf,  KSf 

132 

9.8 

C 

300 

ADD 

Airmar 

JDf 

132 

10 

C 

300 

ADD 

Aquamark 

SSf,  KSf 

145 

20-160 

C,S 

300 

ADD 

SaveWave 

JDf 

155 

30-120 

S< 

200-425 

AHD 

Lofitech 

SSs,  KSf 

193 

15.6 

c 

200 

a  SSs:  Salto,  Sweden,  spring;  KSf:  Kosterhamn,  Sweden,  fall;  SSf:  Salto,  Sweden,  fall;  JDf:  Jammerland, 

Denmark,  fall 

b  C:  constant  frequency;  S:  frequency  sweep 

c  The  SaveWave  pinger  produced  a  series  of  upward-modulated  frequency  sweeps,  which  were  of  variable 
duration  and  rich  in  harmonics.  The  SLs  of  these  signals  were  similar.  Sweeps  were  repeated  up  to  4  times 
per  signal.  Signals  were  repeated  with  a  variable  interval  of  up  to  several  tens  of  seconds.  All  parameters 
changed  randomly  from  one  signal  to  the  next. 

Table  A6.1.  Specifications  of  sound  sources  described  in  this  study. 


Field  site 

Hydrophone 

Recording  unit 

Sound  source 

SSs 

BK  8101 

DAT 

AHD 

SSf 

Reson  TC  4032 

Airmar 

Reson  TC  4034 

DAB 

Aquamark 

KSf 

Reson  TC  4032 

Airmar 

Reson  TC  4034 

AHD,  Aquamark 

JDf 

Reson  TC  4032 

DAB 

SaveWave,  Airmar 

Table  A6.2.  Equipment  used  at  each  field  site  with  corresponding  amplification  and  filtering  details. 
Abbreviations:  B&K  =  Briiel  and  Kjjer  (Danish  hydrophone  company),  DAT  =  Digital  Audio  Tape  Recorder, 
HP  =  high  pass  filter;  LP  =  low  pass  filter,  DAB=Data  Acquisition  Board.  SSs:  Salto,  Sweden,  spring,  KSf: 
Kosterhamn,  Sweden,  fall,  SSf:  Salto,  Sweden,  fall,  JDf:  Jammerland,  Denmark,  fall.  All  hydrophones  were 
calibrated  in  the  laboratory  before  fieldwork. 


A6.3.2  Sound  sources 

Table  A6.1  lists  the  specifications  for  the  sound  sources  and  Figure  A6.2  provides 
the  waveforms,  spectra  and  spectrograms  of  the  acousdc  output  of  each  device. 

A6.3.3  Experimental  protocol 

There  were  a  few  differences  in  how  the  data  were  gathered  and  the  setup  of  the 
recording  chain  between  the  field  sites.  Details  of  the  equipment  variability  are  listed  in 
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Figure  A6.2.  Waveforms  (left),  spectra  (center)  and  spectrograms  (right)  for  each  of  the  sound  sources.  The 
SaveWave  signal  was  an  example  taken  from  the  larger  repertoire  of  signals  (see  Table  A6.1).  Sweep  duration, 
start  and  end  frequencies  and  number  of  repetitions  changed  randomly  from  signal  to  signal. 

Table  A6.2.  The  sound  sources  were  deployed  singly  at  a  fixed  depth  either  by  suspending 
them  from  a  buoy  or  the  edge  of  a  boat  at  the  two  Swedish  sites.  Measurements  at 
Jammerland  took  place  as  part  of  a  separate  study  on  habituation  of  porpoises  to  pingers  and 
employed  a  5  x  3  array  of  15  SaveWave  pingers  spaced  200  m  apart  and  a  5  x  1 1  array  of  55 
Airmar  pingers  spaced  100  m  apart.  All  pingers  were  attached  approximately  0.5  m  below 
the  surface  at  the  end  of  buoys  measuring  2  m  in  length  (fashioned  from  bamboo  sticks 
lashed  to  a  lead  weight  and  a  Styrofoam  float).  The  two  arrays  were  separated  by  about  5 
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Sound  source 

Field  site 

Recording  duration  (min) 

Number  of  signals  measured 

Lofitech  AHD 

KSf 

54 

388 

SSs 

93 

538 

SSf 

41 

423 

Airmar  ADD 

KSf 

62 

211 

JDf 

12 

35 

Aquamark  ADD 

SSf 

KSf 

41 

62 

58 

50 

Save  Wave  ADD 

JDf 

11 

40 

Tabic  A6.3.  Recording  duration  and  number  of  signals  analyzed  for  each  sound  source  and  field  site.  See 
Table  A6.1  for  abbreviations. 

km. 

Recordings  at  all  sites  were  made  by  towing  a  previously  calibrated  hydrophone  from 
a  small  boat  that  drifted  or  was  rowed  very7  slowly  past  the  sound  source  to  cover  both 
distant  and  close  ranges.  The  Reson  TC  4032  and  BK  8101  hydrophones  had  cylindrical 
elements  and  became  directional  receivers  at  frequencies  above  20  kHz.  The  Reson  TC 
4034  had  a  spherical  element  and  was  thus  omni-directional  at  all  frequencies.  All 
hydrophones  were  calibrated  in  the  laboratory  before  experiments  commenced  to  ensure 
that  sensitivities  were  in  agreement  with  the  standards  given  by  the  producers.  For  one  set  of 
experiments  (SSs,  JDf),  the  depth  of  the  hydrophone  was  held  constant  at  2,  3  or  5  m.  For 
the  other  experiments  (SSf,  KSf),  a  Star-Oddi  CTD  tag  was  attached  10  cm  above  the 
h\rdrophone  element.  This  tag  logged  depth,  salinity  and  temperature  once  everv  second  and 
the  data  were  downloaded  at  the  end  of  each  experiment.  The  sampling  rates  for  all 
experiments  ranged  between  48  and  500  kHz  depending  on  the  recording  system  and  the 
pinger  that  was  being  characterized.  All  data  from  the  recording  unit  were  stored  on  a 
laptop  computer.  Table  A6.3  lists  the  recording  duration  and  number  of  signals  analyzed  for 
each  experiment.  A  handheld  GPS  was  used  at  the  Jammerland  field  site  to  provide  the 
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location  of  the  sound  sources.  At  the  two  other  sites,  a  frequency  shift  keying  (FSK)- 
modulated  representation  of  GPS  location  was  synchronously  recorded  to  allow  subsequent 
pairing  of  all  received  signals  with  their  absolute  locations  (see  Mohl  et  al.,  2001). 

The  SL  and  directionality  of  the  AHD  were  measured  in  a  harbor  near  the  field  site 
prior  to  the  field  experiment.  No  boat  activity  was  present  at  the  time  of  this  test.  For  the 
Airmar  and  Aquamark  pingers,  the  measurements  were  made  in  an  echo-free  tank.  The 
hydrophone  was  fixed  one  meter  from  the  transmitting  element  of  the  ADD  or  AHD  and 
the  entire  setup  was  lowered  to  depth.  To  evaluate  the  directionality  of  the  ADD  or  AHD, 
SL  was  calculated  from  several  pings  emitted  at  each  of  several  orientations  of  the  ADD  or 
AHD  relative  to  the  hydrophone. 

A6.3.4  Ping  detection 

Using  customized  Matlab  (Mathworks,  Inc.)  software,  ping  detection  was  partially 
automated  by  locating  ping  events  in  the  recording  that  exceeded  a  user-defined  amplitude 
threshold.  To  qualify  for  analysis,  a  ping  needed  to  fulfill  3  criteria.  It  had  to  1)  be  at  least 
10  dB  louder  than  an  interval  of  silence  of  the  same  duration  immediately  preceding  the 
ping,  2)  correspond  to  the  durations  listed  in  Tables  A6.1  and  A6.3)  be  confirmed  by  the 
user.  Signals  from  Jammerland  were  identified  manually  in  the  recordings  because  they  were 
characterized  by  a  poorer  signal-to-noise  ratio  (SNR)  resulting  from  the  greater  distances 
separating  the  pingers  from  the  hydrophone. 
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A6.3.5  Calculations 


A6.3.5.1  Range 

The  latitude,  longitude  and  depth  of  each  source  and  receiver  were  all  converted  into 
3D  meter  space.  At  the  Jammerland  field  site,  the  Cartesian  distance  between  the  receiver 
and  the  closest  pinger  source  was  computed  as  the  range.  For  the  two  other  sites,  the 
Cartesian  distance  was  simply  calculated  between  the  receiver  and  the  single  source. 

A6.3.5.2  Sound  Exposure  Eevel  (SEE) 

All  pings  of  constant  frequency  (see  Table  A6.1)  were  band-pass  filtered  around  their 
central  frequency  using  a  two-pole  Butterworth  filter  to  exclude  extraneous,  non-ping 
energy.  For  frequency  sweep  signals,  a  two-pole  Butterworth  band  pass  filter  was  applied 
above  and  below  the  lowest  and  highest  frequencies  contained  within  the  signal.  The 
received  acoustic  energy  of  every  ping  was  computed  as  the  energy  flux  density,  or  SEL  (for 
sound  exposure  level),  defined  as  the  logarithm  of  the  sum  of  the  squared  pressure  over  the 
ping  duration  in  dB  re  1  pPa:s: 

SEL  =  1 0 log  p2(t)dt  =  lOlog^rJ^  />2(/y/j  +  101og(T)  +  120  (1) 

where  p(/)  is  the  instantaneous  pressure  at  time  t  and  the  duration  T  of  the  signal  contains 
90%  of  the  energy  (Blackwell  et  al.,  2004;  Madsen,  2005).  A  calibration  signal  of  known 
sound  level  was  routed  through  the  entire  recording  chain  and  used  as  a  reference  for  the 
computations. 

The  SaveWave  signals  contained  energy  beyond  the  range  of  the  flat  frequency 
response  of  the  hydrophone.  To  compensate  for  this  reduced  sensitivitv,  these  signals  were 
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Figure  A6.3.  Source  energy  level  (at  one  meter  distance)  of  the  Airmar  and  Aquamark  pingers  recorded  in 
various  directions.  The  levels  of  the  CF  (constant  frequency)  and  sweep  ping  are  denoted  uniquely  (+  and  o, 
respectively).  The  orientation  scenarios  1-6  of  the  pingers  and  receivers  are  illustrated  graphically  beneath  the 
plots.  The  pinger  (black  and  white  oval)  was  recorded  from  the  direction  indicated  by  the  origin  of  the  arrow. 
The  first  pinger  was  recorded  from  its  north  pole,  the  middle  four  from  the  equator  at  four  different  pinger 
orientations  and  the  final  image  from  the  south  pole. 


adjusted  by  amplifying  the  high  frequencies  in  this  range.  At  the  greatest  distances  where  the 
SNR  was  poor,  the  SELs  from  the  SaveWave  were  calculated  once  the  energy  of  the 
background  noise  immediately  preceding  the  signal  was  subtracted.  Airmar  recordings  from 
Jammerland  were  similarly  characterized  by  a  poor  SNR  at  large  distances.  These  ping  levels 
were  therefore  determined  by  the  peak  of  the  average  power  spectrum  calculated  over  the 
complete  signal  duration. 
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Figure  A6.4.  Received  sound  exposure  level  as  a  function  of  range.  Slopes  obeying  cylindrical  and  spherical 
spreading  laws  and  absorption  are  shown  by  the  dotted  and  solid  lines,  respectively. 


A6.4  Results 

Figure  A6.3  displays  the  SL  measurements  of  the  Airmar  and  Aquamark  in  different 
direcdons,  revealing  anomalies  of  up  to  4.7  and  25.7  dB,  respectively.  Figure  A6.4  plots  SEL 
as  a  function  of  range  for  all  sound  sources  in  each  environment.  The  lines  indicating 
spherical  and  cylindrical  spreading  are  not  intended  to  compare  the  expected  and  actual 
SELs  but  rather  to  show  patterns  of  the  slope  predicted  by  these  basic  models.  Figure  A6.4 
illustrates  that  despite  an  overall  trend  for  SEL  to  decrease  with  increasing  distance,  a 
tremendous  amount  of  dynamic  range  in  the  SEL  existed  over  a  given  range.  This 
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Figure  A6.5.  Received  sound  exposure  level  from  a  Lofitech  AHD  source  as  a  function  of  range  for  a 
recording  using  a  hydrophone  that  continuously  approached  a  stationary  pinger.  Imagining  an  animal  moving 
along  a  trackline  similar  to  the  one  here,  a  steadily  reliable  decrease  with  increasing  range  would  not  occur  since 
the  levels  fluctuate  dramatically.  See  text  for  further  elaboration. 


phenomenon  appeared  consistently  in  the  plots  for  all  of  the  sound  sources  and 
environments. 

The  upper  left  subpanel  of  Figure  A6.4  is  enlarged  in  Figure  A6.5  to  show  that 
fluctuations  in  SEL  at  a  particular  range  were  often  much  greater  than  those  between  two 
rather  different  ranges.  Figure  A6.5  can  also  be  viewed  as  the  series  of  SELs  that  an  animal 
would  encounter  if  it  were  traveling  direcdy  towards  or  away  from  the  AHD  Lofitech 
source.  An  animal  traveling  away  from  the  AHD  would  experience  a  constandy  fluctuadng 
SEL,  generally  trending  downwards,  but  with  successive  pings  in  the  sequence  increasing 
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and  decreasing  unpredictablv. 


A6.5  Discussion 

There  was  a  pronounced  variability  in  SELs  of  up  to  19  dB  at  constant  ranges  out  to 
beyond  1  km  from  the  AHD  (Lofitech).  For  the  ADDs  (/.&,  the  Airmar,  Aquamark  and 
SaveWave  pingers),  the  variability7  was  less  pronounced  at  long  ranges.  At  a  range  of  100  m, 
there  was  up  to  10  dB  of  variation  for  the  Airmar  pinger  and  up  to  6  dB  for  the  Aquamark 
100  (Figure  A6.4).  The  overall  trend  of  decreasing  SEL  with  increasing  range  from  the 
ADD  or  AHD  (Figures  A6.4  &  A6.5)  was  disrupted  by  interference  patterns.  Such  variability 
and  deviation  from  spherical  or  cylindrical  spreading  expectations,  even  at  large  distances 
trom  the  source,  conflicts  with  the  classic  description  of  concentric  zones  of  increasing 
disturbance  with  decreasing  range  (Richardson  et  al.,  1995).  This  also  poses  a  difficult}7  for 
an  animal  attempting  to  predict  level  on  a  fine  scale  and  orient  with  respect  to  this  variable 
intensity  gradient.  The  spatial  extent  of  these  zones  is  clearly  difficult  to  predict,  especiallv 
given  the  plasticity  of  an  animal’s  thresholds  of  detection,  injurv  and  avoidance  resulting 
trom  its  motivation,  behavior  and  physiological  state. 

One  of  the  motivating  concerns  for  launching  this  study  was  the  possibility  that 
constructive  interference  could  generate  unpredictable  pinger  SEL  hotspots  of  sufficiently 
high  intensity  that  might  lead  to  unexpected  hearing  damage  in  marine  mammals.  Although 
the  recorded  levels  fell  below  the  intensities  that  caused  temporary  threshold  shifts  and 
temporary  losses  of  hearing  sensitivity  (i.e.,  195  dB  re  lpPa2s,  Finneran  et  al.,  2005),  Figures 
A6.4-A6.5  reveal  that  moving  away  from  the  source  did  not  necessarily  guarantee  that  SEL 
would  decrease.  This  alters  the  way  in  which  we  should  understand  an  animal’s  perception 
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of  an  AHD-  or  pinger-emitted  sound  field.  While  swimming  away  from  a  sound  source,  the 
animal  could  be  exposed  to  dramatic  sound  level  variations  over  very  small  spatial  scales. 
Theoretically,  the  sound  level  may  shift  by  several  orders  of  magnitude  within  a  fraction  of  a 
meter  (Wahlberg,  2006).  If  the  animal  integrates  time  of  arrival  and  phase  shift  differences 
between  its  ears  with  a  series  of  level  cues  and  these  two  sets  of  sensory  cues  oppose  one 
another,  it  may  be  difficult  to  determine  the  direction  to  and  location  of  the  sound  source. 
Natural  orientation  cues  may  also  be  obscured  by  artificial  signals  through  masking  and  from 
temporary  threshold  shifts  reported  to  occur  at  levels  below  those  measured  here  (Schlundt 
et  al.,  2000).  This  possibility  conflicts  with  the  hypothesis  that  animals  learn  to  avoid  an  area 
due  to  an  acoustic  deterrent.  The  rapid  and  unpredictable  variations  in  the  sound  intensity 
as  a  function  of  range  to  the  pinger  may  seriously  confuse  the  animal  and  make  avoidance 
responses  more  complicated  than  intended.  If  the  animal  uses  subsequent  pings  to  improve 
its  ability  to  assess  directionality  of  a  signal  (as  indicated  by  Kastelein  et  al.,  2007),  this 
problem  becomes  more  serious. 

We  still  need  to  test  whether  large  spatial  variations  in  SELs  prevent  animals  from 
reacting  appropriately  to  ADD  and  AHD  signals.  Besides  the  actual  problem  of  detection 
and  determination  of  the  direction  to  the  sound  source,  the  behavior  of  the  animals  may  be 
influenced  by  a  learning  component  that  needs  to  be  addressed.  Grey  seals  lifted  their  heads 
out  of  the  water  in  response  to  AHD  signals  (Bordino  et  al.,  2002;  Fjalling  et  al.,  2006)  and 
physiological  (Clark,  1991),  behavioral  (Olesiuk  et  al.,  2002)  and  masking  (Southall  et  al., 
2000)  effects  have  been  observed.  Further  studies  between  acoustic  deterrents  and  marine 
mammal  responses  are  required  to  examine  how  animals  behave  around  and  react  to  fishing 
nets  with  and  without  pingers.  These  issues  could  be  addressed  by  comparing  the  acoustic 
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measurements  of  the  pinger  signals  reported  here  with  the  behavior  of  animals  swimming 
through  the  sound  field. 

The  variability  in  the  SEL  may  be  an  important  factor  to  consider  when  evaluating 
the  implementation  of  acoustic  mitigation  devices  in  fishery  regimes.  The  dynamic 
characteristics  of  a  trawl,  for  example,  could  influence  the  source  directionalitv  and  multi- 
path  interference,  potentially  contributing  to  even  larger  SEL  fluctuations  than  observed 
under  static  conditions.  Some  newly  developed  acoustic  mitigation  devices  (/. e. ,  DDD02F) 
operate  with  SLs  higher  than  160  dB  re  1  pPa2s,  further  contributing  to  concerns 
surrounding  their  implementation  (Dalgaard  Balle  &  Larsen,  unpublished  data). 

The  variability  in  SELs  observed  in  this  study  could  have  been  caused  by  a 
combination  of  inter-ping  SL  variations,  bathymetry,  wave  action  influencing  the  surface 
reflections,  multi-path  interference,  and  source  directionality.  Salinity  and  temperature 
effects  were  unlikely  to  have  played  a  strong  role  because  neither  a  pronounced  halocline  nor 
thermocline  was  observed  (measured  at  SSf  and  KSf  with  the  Star-Oddi  CTD  tag)  and 
because  computer  modeling  has  demonstrated  that  such  an  influence  would  be  rather  small 
for  the  ranges  of  interest  here  (Westerberg  &  Spiesberger,  2002).  Source  directionality  and 
multi-path  propagation  will  now  be  explored  more  explicitly.  The  pingers  were  mounted 
vertically  to  record  signals  from  the  broadside  axis  and  thereby  minimize  directionalitv 
effects.  The  Airmar  pinger  showed  sub-dB  variations  in  its  inter-ping  SL  when  recorded  in  a 
fixed  direction,  whereas  the  Aquamark  100  showed  a  larger  variation,  possibly  because  of 
slight  variations  in  SL  for  the  various  sound  types  emitted  (Figure  A6.3).  The  broadside  SL 
of  the  Airmar  pinger  varied  less  than  2  dB  when  rotating  the  pinger  about  its  axis  (Figure 
A6.3).  Therefore,  because  the  Airmar  pingers  were  recorded  at  small  angles  relative  to  their 
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axis  of  symmetry,  most  of  the  variability  in  their  SELs  as  a  function  of  range  was  attributed 
to  multi-path  propagation.  Multi-path  modeling  demonstrates  that  variability  of  the 
magnitude  observed  here  can  result  from  the  interference  of  direct,  surface-reflected  and 
bottom-reflected  rays  (Wahlberg,  2006). 

For  the  Aquamark  pinger,  the  transmission  beam  pattern  was  more  complicated  and 
variable  and  depended  on  which  of  the  two  types  of  signals  was  being  emitted  (Figure  A6.3). 
The  SL  was  not  only  variable  between  the  pinger’ s  axis  of  symmetry  and  broadside,  but  it 
also  varied  by  13  dB  on  the  broadside  when  rotated  about  its  axis  of  symmetry.  It  was  not 
clear  to  what  extent  the  source  directionality  and  multi-path  variation  each  contributed  to  the 
SEL  variation  for  the  Aquamark  pinger.  The  signals  produced  by  the  SaveWave  pingers 
were  variable  in  duradon  and  frequency  spectrum,  causing  the  transmitted  energy7  to  vary7 
from  one  signal  to  the  next,  which  may  at  least  pardally  explain  the  observed  SEL  variability. 

The  soft  and  hard  bottom  locations  did  not  produce  clear  differences  in  the  SEL 
variability.  This  is  surprising  since  a  softer  bottom  should  have  rendered  fewer  multi-paths, 
leading  to  a  less  complicated  SEL  pattern  as  a  function  of  range.  The  soft  bottom  may  have 
reflected  sound  better  than  expected,  diminishing  the  differences  in  acoustic  propagation 
between  the  experimental  sites.  In  addition,  the  soft  bottom  site  was  shallower  than  the 
hard  bottom  site,  which  may  have  confounded  the  possible  effects  of  bottom  properties  on 
multi-path  propagation. 

The  efficiency  of  pingers,  quantified  both  in  terms  of  their  power  demands  and  the 
quantity  of  sound  that  they  are  able  to  discharge,  may  be  improved  by  decreasing  the 
duration  of  the  emitted  signal,  which  would  lead  to  a  reduction  in  the  interference  patterns 
measured  here.  This  suggestion  must  be  balanced,  however,  with  the  important  issue  that  to 
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obtain  a  maximum  effect,  the  signal  loudness  should  exceed  some  critical  threshold  for  an 
animal’s  particular  integration  time  that  will  produce  the  desired  avoidance  or  disturbance 
response.  More  work  is  required  to  explore  the  behavior  of  seals  and  porpoises  in  relation 
to  ADD  and  AHD  sound  sources  with  realistic  SLs  and  their  interaction  with  fishing  gear  in 
light  of  more  complex,  non-geometrical  spreading  models.  The  interplay  between 
conservation  and  marine  mammal  and  fishery  interactions  must  continue  to  be  engaged  by 
consistent  research  efforts  that  explore  the  ways  in  which  these  ADDs  and  AHDs  actually 
operate  and  influence  the  animals  that  they  are  intended  to  target. 

In  conclusion,  we  found  that  signals  from  ADDs  and  AHDs  did  not  propagate  in  a 
coastal  environment  according  to  the  simple  models  of  spherical  or  cylindrical  spreading  that 
posit  zones  of  increasing  impact  with  decreasing  range  (Richardson  et  al.,  1995).  The 
acoustic  field  to  which  animals  are  exposed  when  approaching  a  pinger  underwater  is  thus 
complicated  and  not  easily  described  by  these  concentric  zones  of  responsiveness,  masking 
and  discomfort  relative  to  the  range  from  the  ADD/AHD.  Instead,  the  SEL  varied  several¬ 
fold  within  very  short  distances,  likely  as  a  result  of  the  interference  of  direct,  surface- 
reflected  and  bottom-reflected  rays  (Wahlberg,  2006).  The  behavior  of  seals  and  cetaceans 
in  relation  to  the  sound  field  of  ADDs  and  AHDs  should  be  prioritized  in  future  research. 
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These  animals  carousel  feed,  corralling  herring  into  a  ball  before  tail  slapping,  incapacitating  and  eating  the  fish.  Periods  of  tail 
slapping  were  characterized  by  elevated  movement  variability,  heightened  vocal  activity  and  call  types  containing  orientation 
cues.  Two  types  of  behavioral  sequence  preceded  the  tight  circling  of  carousel  feeding.  First,  the  animals  swam  directionally  and 
in  2  of  3  instances  were  silent,  suggesting  they  may  have  been  eavesdropping.  Second,  the  animals  moved  in  broad  horizontal 
loops  and  all  4  occasions  were  accompanied  by  vocal  activity,  indicating  that  this  and  tail  slapping  may  benefit  from  social 
communication.  Using  human  speech  processing  techniques,  I  considered  whether  calls  are  comprised  of  shared  segments  that 
can  be  recombined  to  form  the  stereotyped  and  variable  repertoire.  These  shared  segments  performed  as  well  as  the  traditional 
whole  call  types  in  a  classification  experiment  and  required  less  information  to  parse  the  vocalizations. 
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